THE BULLETIN OF 
Mathematical 


BIOPHYSICS 


SEPTEMBER 1960 


Kinetics of Ultrafiltration Hemodialysis—Robert I. Macey and 
A.V. Wolf ---------+--+----++----+5--- oi7 


Be h Quantum- Theoretic Approach to Genetic Problems—Robert Rosen- 227 


| Further Contributions to the Mathematical Biophysics of Automobile 
_ Driving—N. Rashevsky -----------+-+-+---- 257 


A Paibution to the Mathematical Biology of Automobile Driving: 


IL Passing as a Case of Psychophysical Discrimination—N. 
SO Se Ye NE OL ee a ee a ae 263 


Caton Fpecheet = be one we 269 
‘Sat Set Theory: asap aherealnigs for Chemical 
action Mechanisms—Anthony F. Bartholomay- - - - - - - - 285 


INIVERSITY OF CHICAGO PRESS - CHICAGO 
| Bo NUMBERS 


ya. 
son 


The Bulletin of 
MATHEMATICAL BIOPHYSICS 


Epitor: 
N. RASHEVSKY, UNIVERSITY OF CHICAGO 
AssoctaTe Epitors: 
H. D. LANDAHL, UNIVERSITY OF CHICAGO 
ANATOL RAPOPORT, UNIVERSITY OF MICHIGAN 


The BULLETIN is devoted to publications of research in Mathematical 
Biology, as described on the inside back cover. 


AUTHORS: 


Please read carefully the inside back cover. 


Published March, June, September, and December. Subscription 
rates per volume: U.S.A., $12.00; Canada and Pan America, $12.50; 
all other countries in Postal Union, $13.00. Single copies: $4.00. 
Checks should he made payable to The University of Chicago Press, 
5750 Ellis Avenue, Chicago 37, Illinois. Subscriptions will be en- 
tered to start with the first issue to be published after receipt of 
order. Editorial correspondence: The Editor, The Bulletin of Math- 
ematical Baophysics, 5741 Drexel Avenue, Chicago 37, Illinois. 
Subscribers are requested to notify the Press and their local post- 
master immediately of change of address, giving both the old and 
che new address. 


Second-class postage paid at Lancaster, Pennsylvania. 


[Copyright 1960 by the University of Chicago] 


BULLETIN OF 
MATHEMATICAL BIOPHYSICS 
VOLUME 22, 1960 


KINETICS OF ULTRAFILTRATION HEMODIALYSIS 


ROBERT I. MACEY* AND A. V. WOLF 


DEPARTMENT OF PHYSIOLOGY 
UNIVERSITY OF ILLINOIS COLLEGE OF MEDICINE 
CHICAGO, ILLINOIS 


The expressions of Wolf et al. (1951) and Renkin (1956) for the ki- 
netics of artificial kidneys are generalized to include the effects of fil- 
tration. If B is the bath volume, 6 the relevant volume of distribution, 
f the filtration rate, ¢ the time, and Ag, Bo, bo represent A, B, and b at 
time ¢ = 0, then the plasma concentration A is given by 


(Bgtb,) D 
ne ove Bo Ber, b, Ree bo 


Ay = Boe do Bo + bo 


where D; is arate constant, and X (ft) is given by 


oo M+1 M , M+1 

Bo +(-1)" bo B 

K (ft) = ———_- ) —____——____ —ft)# = 1+ ft +... 
Bo + bo var (M + 1) Bo bo 2Bobo 


Letting @9 denote the flow rate at the entrance to the dialysis tubing, / 
the net rate of filtration, P the permeability, S the surface area, V the 
volume of fluid in the dialysis tubing, and ¢, the circulation time of the 
fluid through the dialysis tubing, then D+ is given by 


eres 
Dr=(ao-f) \i-e " ° 


When / = 0, D+ reduces to the ordinary dialysance constant. 

These corrections to the filtration-free case are small when con- 
sidering an adult male body volume together with the parameters of pres- 
ent day artificial kidneys. 


Introduction. Recent advances in artificial kidney techniques 
have seen the introduction of relatively high blood pressures into 


*Present address Department of Physiology, University of California, 
Berkeley 4, California. 


217 


218 ROBERT I. MACEY and A. V. WOLF 


the dialysis unit in an effort to control body fluid volume as well 
as composition. The effects of thus superposing filtration upon 
the ordinary diffusion process must be accounted for in any kinetic 
analysis of the system. 

In this paper we attempt to generalize expressions previously 
developed by Wolf e¢ al. (1951) and Renkin (1956) to include fil- 
tration effects. 

Kinetics with Zero Filtration. Consider a solute existing at a 
concentration A uniformly dispersed within a body volume, 6. The 
ability of artificial kidneys to clear this solute from the body has 
been characterized by the introduction of a parameter D called the 
dialysance (Wolf e¢ al.; loc. cit.). The dialysance is defined as 
the rate of loss of solute by the body per unit concentration gra- 
dient between the body and the fluid bathing the dialysis unit, i.e. 
in terms of the bath concentration U, 

d(bA) 
dt 


LT pees (1) 


where ¢ denotes time. 
If all of the solute is initially contained in the body in an amount 


b,A,, and if the bath volume is denoted by B, then material bal- 
ance requires that 


b,A, = bA + BU (2) 


Assuming D to be time independent, (1) can be integrated with the 
aid of (2) to give 
Ay ibe vee B, + 5, 

Expression (3) has been verified experimentally (Wolf ez al., 
loc. cit.; Black, 1959), The value of D depends on the magnitude 
of flow through the tubing and on the permeability and surface area 
of the tubing. Letting @ represent the flow, P the permeability, 
and S the surface area, Renkin (loc. cit.) has shown 


_Ps 
Dz=a \l-e =] 


=a (1 - 27) ; (4) 


ULTRAFILTRATION HEMODIALYSIS 219 


V 


where ¢, = 2 t, being the circulation time required to transmit 


fluid from the arterial to the venous end of the dialysis tubing. 
Expression (4) also has been verified by experiment. 

The assumption of a quasi-stationary state is necessary in the 
derivation of (4). More specifically, it must be assumed that the 
change in arterial concentration during the time interval required 
for a single circulation through the dialysis unit is negligible. In 
what is to follow, we shall make a similar restriction. Thus, for 
example, the present analysis might not apply so well to the in 
vitro experiments of Maher, Schreiner and Marc-Aurele (1959). 

The Dialysance during Ultrafiltration Dialysis. Equation (4) 
shows that for the case of zero filtration D is independent of time. 
However, in principle, as soon as a nonzero filtration is introduced, 
(4) is no longer applicable, and it can be seen that rigorous ad- 
herence to expression (1) implies that D is time dependent. For 
example, as time increases, the denominator in (1), A-JU, will 


d 
approach zero but the numerator, ae) will not, because there 


will always be a nonzero filtration term. As a result we can ex- 
pect D to become infinite as ¢ becomes infinite. This suggests 
that in the presence of filtration, the dialysance defined by (1) 
loses its utility. However it will be seen that a new rate con- 
stant D, can be defined which bears a formal resemblance to D (cf. 
equation 19), and differs only slightly in magnitude from D when f 
is small. 

The Kinetics of Ultrafiltration Dialysis. We make the usual as- 
sumptions that composition is uniform within any cross section of 
the tube, and that diffusion in the axial direction can be ignored. 
Further we assume perfect mixing within the bath and the body, and 
that the shape of the tubing remains uniform throughout its length. 

In order to apply the mass continuity principle, we must first 
calculate the net solute flux through the membrane per unit mem- 
brane surface area. Let z denote the coordinate in the axial direc- 
tion, i.e. z is the distance from the origin (arterial end) of the di- 
alysis tubing to any point within the tubing. In general the solute 
flux through the membrane will be dependent on 2 and we shall de- 
note it by J(z). Further, our assumptions imply that the concen- 
tration C of solute within the dialysis tubing depends only on 2 
and ¢, and that outside the tubing (in the bath) the concentration U 


220 ROBERT I. MACEY and A. V. WOLF 


depends only on ¢. However, for any particular value of @ there will 
be a variation in concentration as we pass through the wall of the 
tubing from the inner surface contiguous with the circulating body 
fluids to the outer surface contiguous with the bath. In other words, 
letting y denote the perpendicular distance from the inner surface 
to any point within the wall of the tubing, and letting 6 denote the 
thickness of the wall and C’ the concentration within the wall, we 
have at y = 0, C’ = C; and at y = 6, C’ = U. Inaddition, if W is the 
diffusion coefficient within the wall and ¢ the rate of filtration per 
unit area of dialysis membrane, we have 


’ 


where J is the solute flux per unit area through the dialysis mem- 
brane. 

The specific forces involved in causing the filtration 9 will re- 
main unspecified. Making the standard assumption that J is in- 
dependent of y and using the boundary conditions C’ =C at y=0, 
this equation can be integrated to read 


- hae 
C=C eb” 2 [1-28 | 


Setting y = 6 at C’ = U and solving for J we arrive at 


Ps 
see Ue 


Bey 
ef -1 
ene 
If + is sufficiently smalle 9 =#1+~8 
0 0 
and 
J (a) = P[C(z) —- U] + 9 C(2) (5) 


where it is understood that J, C and U are to be considered func- 
tions of ¢ as well as e. In an effort to justify the above approxima- 


tion the order of magnitude of x can be estimated from the fol- 


ULTRAFILTRATION HEMODIALYSIS 221 


lowing typical figures for the permeability P, surface area S and 
total net filtration f: P Ae .02 cm./min. (urea), S = 2.0 x 104 


em’, and f<10 cc/min. If the filtration is uniformly distributed 
throughout the membrane surface, then 9 = f/S < 5 x 107* cm./min. 


and 2 8 = 0/P < 2.5 x 107?, 


It is likely that the filtration is not uniformly distributed and 
consequently we would have underestimated x8 in those areas 


with particularly high filtrations. However, since the estimated 
order of magnitude of p8/) is very small, the approximation 


@ 
ef ~1+78 


would probably still be valid. For example, if the filtration rate 
varied linearly along the length of the tubing, the maximum value 


of 7 would still be less than 5 x 1077. 


Having calculated J, we now apply the mass continuity principle 
to the solute existing at concentration C in the small volume ele- 
ment AV; within the tubing between z and 2+ Az whose surface 
area is AS, Letting a(x) denote the magnitude of fluid flow in the 
axial direction within the tubing, we have 


AV = = a(z)C(x) - Ar + Az)C(x + Az) -J(a2)AS (6) 


The left hand term of (6) represents the rate of accumulation of 
solute in AV; the first term on the right represents the rate of 
solute flow into AV; the next term represents the rate of flow of 
solute out of AV and J(z)AS represents the exchange taking place 
between AV and the bath. Incorporating (5) into (6), and noting 
that if L represents the total length of the tubing and o the cross 
sectional area of AV, then AS =(S/L)Az, AV =oAz, and we 
have upon taking the limit as Az — 0, 


aca PS os 
Se ahi (OU C0 7 
rds ved ose ers (7) 


222 ROBERT I. MACEY and A. V. WOLF 


da ried Pee. Pe 
The flow gradient oF can be related to the filtration in a similar 
ee 


way by applying the mass continuity principle to the fluid in AV 
with the result that 


se OS (8) 


ere ria in (9) 


Applying the same considerations to 5A, the total mass of solute 
within the body volume 4, and letting a, = a(0), and a, = aL), we 
have 


(04) = a, C(L,t) - a,C(0,t). ee 


Finally, if the solute were confined initially entirely to the body 
space in an amount 5,A), then conservation of mass applied to the 
entire system requires that 


b,A, = bA+ BU (11) 


where B denotes the bath volume. 

Expressions (9), (10) and (11) are to be solved simultaneously 
with the conditions C(0,t) =A, C(z,0)=0, and U(0)=0. This 
process will be simplified if it is assumed that the changes in 
body and bath concentrations A and U take place sufficiently slowly 
that the solute within the dialysis tubing is practically in a steady 
state at all times. This assumption is suggested by the fact that 
both the body and bath volumes are large compared to the total 
tubing volume V. Further, as indicated above, this quasi-steady 
state assumption is implicit in Renkin’s derivation of (4) for the 
zero filtration case, and this expression fits appropriate experi- 
mental data. It appears then, that unless the filtration rate is very 
large, the assumption is also valid in our case, and accordingly we 


aC 
set isn 0 in (9) and write the integrated form of (9) as 


PS ctx) 


CHU wh OYEeo (12) 


ULTRAFILTRATION HEMODIALYSIS 223 
The delay time T(z) is given by 


x d@ 
T(2) = 0 i = 
: > (8) CN 


Ped 6. 
Further, with cn 0, multiplying (7) by dz and integrating be- 


tween « = 0, and z=L yields 


PS ft 0 ee 
a, O(L,t) ~ a,0(0,0) + [ (0- U)de+— { Copdz=0. (14) 
AR Lads 


It is convenient at this point to introduce a cumulative filtration 
function ®(z) by the relation 


S px 
D(z) = rf odé (15) 
and note from (8) and (15) that 
— =a, -a4,=-- O(L)=-f (16) 
where f is the total filtration rate of the entire dialysis unit. 


Expressions (10), (12), (14), (15) and (16) can then be combined 
to give 


S L a at 
cf me PP ae-f]4-) an 
0 


If the second integral above is integrated by parts with the aid 
of (8) and (13), the expression can be simplified to 


dA ated 
4 liane 4 )] 4-4 (18) 


where f = ®(L) and ¢, = T(L). Comparing (18) with (4) we see that 
the quantity in the square brackets bears a resemblance to D. 


224 ROBERT I. MACEY and A. V. WOLF 


Calling this quantity D,, i.e., 


Ps 
D(a 1-2 | (19) 
we note that when f = 0 with a constant flow a, ¢, = V/a and D, = 
D. Under most circumstances f is independent of time, and hence 
t, and D. will also be time independent. 

The Time Course of Ultrafiltration Hemodialysis. Substituting 
the value of U, obtained from (11), into (18), and noting that con- 
servation of volume for the entire system implies that B(0) + (0) = 
B(t) + b(t) we have 


Z pol 
By ato ah ae by an 


A 
A; By + by By + by 


where we have substituted B, for B(0), 5, for 5(0) and A, for A(0), 
in order to maintain our notation consistent with that of Wolf ez al, 
(loc. cit.). 

If we consider the usual case, f and D, are time independent, 
and B and 6 take on the linear forms 


B=B, + ft 
b=), - ft (21) 
Inserting (21) into (20) yields 
tne els. eos = See 
A, B,+, LB, (6, -f) By + bo 
Equation (22) corresponds to (3) for the case f= 0, but in this 
form its relation to (3) is obscure. This can be remedied by taking 
advantage of the fact that both B, and 6, are much greater than /t, 


the total volume filtered at time ¢. Consequently we can expect 
rapid convergence of the following Taylor expansion, 


1 1 
Bb (By + ft)(b,-f) _ 


(22) 


1 z BMS af 1) oe oe 
iY. putt wei Cf)". — (28) 
0 0 


0 0 M=0 


ULTRAFILTRATION HEMODIALYSIS 225 
Inserting (23) into (20), we have 


B,+b,D 
ri B, Siar preedbeat by (24) 
re 8. ns by BS + by 
where 
co Mt+1 M Mt+1 
ig Nees » Eben ts AED hy a 
B, +5, ran CTY Ee 
1 By” - 5,” 
cerry CORR ery: prersh id oo 
0 0 
Be ans b 4 
38,753 Pisin |e 


Except for the correction K in the exponent, expression (24) has 
the same form as (3). When f=0, K =1. 

Up to this point we have made no assumptions about the nature 
of ~(z) other than it be time independent. According to the above 
analysis, all that is needed is the net filtration rate f and some 
estimate of the circulation time ¢,. This can be determined ex- 
perimentally, or if the details of the filtration are known, f can be 
calculated from (15), and ¢, from 


L dz 
t =o rn 26 
= { a, — P(z) 2) 


V V 
Since ®(z) is monotonic, ¢, would be bounded by a and <t Thus 
i) L 
if f were small the values of a, and a, would lie close together 
and an approximate estimate of ¢, would be given by 


1, 1 1 
rates ee en 
2\aq, a 2 \a,: «4, -f 


If body water volume as in a 70 kg man is employed in the for- 
mulas developed in this paper, it will be seen that the modifica- 
tions introduced by the filtration process are small. For example 
if B, = 10° cc., 6, = 4.2 x 10* cc., ¢= 360 min., f= 10 cc./min., 
a, = °300 cc./min., PS = 400 cc./min., and V = 1000 cc., we have 


226 ROBERT I. MACEY and A. V. WOLF 


D, = 215 cc./min.; as compared with the corresponding filtration 
free case D = 221 cc./min. Again, using these figures the “‘cor- 
rection factor’? K (ft) of equation (24) differs from unity by only 
about 3%, and the estimated value of A/Ao differs from the filtra- 
tion free A/Ao by even less. 

Summary of Assumptions. The above analysis is based on the 
assumptions of:1) uniform composition within any tubing cross 
section; 2) negligible axial diffusion; 3) perfect mixing within the 
body and bath; 4) uniform tubing shape throughout its length; 5) 


negligible solute interaction; 6) 2 << 1; and 7) a quasi-steady state. 


This work was aided by grant number H-4740 from The National 
Institutes of Health. 


LITERATURE 


Black, M. W. 1959. ‘‘Clinical Guides for the Estimation of Water and 
Urea Removal during Hemodialysis with the Kolff Twin Coil Kidney.” 
Tr, Am. Soc. Artificial Internal Organs. 5, 50-57. 

Maher, J. F., G. E. Schreiner, and J. Marc-Aurele. 1959. ‘‘Methodologic 
Problems Associated with in Vitro Measurements of Dialysance.’? Tr. 
Am. Soc. Artificial Internal Organs. 5, 120-28. 

Renkin, E. M. 1956. ‘‘The Relation between Dialysance, Membrane 
Area, Permeability and Blood Flow in the Artificial Kidney.”? Tr. Am. 
Soc. Artificial Internal Organs, 2, 102-105. 

Wolf, A. V., D. G. Remp, J. E. Kiley, and G. D. Currie. 1951. ‘*Artifi- 
cial Kidney Function: Kinetics of Hemodialysis.”? J. Clin. Investiga- 
tion, 30, 1062-1070. 


RECEIVED 12-31-59 


A QUANTUM — THEORETIC APPROACH TO 
GENETIC PROBLEMS 


ROBERT ROSEN 
COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


A quantum-theoretic picture of the transfer of genetic information is 
described. The advantage of such an approach is that a number of 
genetic effects appear to be explicable on the basis of general micro- 
physical laws, independent of any specific model (such as DNA-protein 
coding) for the transmission of genetic information. It is assumed that 
the genetic information is carried by a family of numerical observables 
belonging to a specific microphysical system; it is shown that a single 
observable is theoretically sufficient to carry this information. The 
various types of structure that this observable can possess are then de- 
scribed in detail, and the possible genetic effects which can arise from 
each such structure are discussed. For example, it is shown how the as- 
sumption that the genetic observable possesses degenerate eigenvalues 
may lead to a theory of allelism. To keep the treatment self-contained, 
the basic quantum-theoretical principles to be used are discussed in 
some detail. Finally, the relation of the present approach to current 
biochemical ideas and to earlier quantum-theoretic treatments of genetic 
systems is discussed. 


I. Introduction: The Quantum-Theoretic Description of Micro- 
physical Systems. In this section we shall set down the basic 
notions of the quantum-theoretic formalism which will be used in 
the sequel. Most of this material is basically well known, but for 
the sake of completeness, and because our emphasis will differ 
somewhat from some other treatments, we have thought it best to 
include this brief discussion. In this section, we shall assume a 
certain familiarity with the terminology of the theory of self-adjoint 
operators on Hilbert space, and its physical significance (see in 
particular von Neumann, 1955). Since the purpose of this section 
is primarily to motivate the development which follows, the more 
technical portions of our discussion can probably be omitted by 
readers unfamiliar with this material. 


227 


228 ROBERT ROSEN 


Far from being an ‘‘abstract’’ theory, modern quantum physics is 
solidly rooted in experiment, and is in fact almost entirely an ‘‘op- 
erational’’? theory; all of its definitions and manipulations are 
based on explicit procedures which may (at least theoretically) be 
carried out on real physical systems. The basic notions of quantum 
theory are founded on the notions implicit in the process of the 
‘‘measurement’’ of numerical attributes of such systems. Itis clear 
that the most useful type of measurement, for scientific purposes, 
is one which leaves unchanged the value of that attribute which is 
being measured, so that an immediate repetition of the observing 
process will yield the same value as the original measurement. A 
measurement of this type is called an observation. 

The simplest kind of observation which can be performed on a 
microphysical structure is the determination that a particular nu- 
merical attribute of the structure (such as its energy) has a value 
falling between two particular real numbers. Thus, if we let P 
represent the operation of determining that a particular numerical 
attribute of a given structure assumes a numerical value é lying 
between the numbers a and 6 (i.e. a < &< 5) then P will be an ob- 
servation if, and only if, P? = P (where P? denotes the operation 
of carrying out the process P twice in succession). If P’ repre- 
sents the observation which determines whether the attribute in 
question assumes a value lying between the numbers a’, 5’, where 
the intervals (a, 5) and (a’, 6’) are disjoint, then the obvious fact 
that no attribute of a given physical structure can simultaneously 
assume a value lying in two disjoint intervals leads immediately 
to the algebraic property P’P = PP’ =0, where 0 corresponds to 
the ‘‘empty’’ observation. Finally, if we decompose the entire real 
line into mutually disjoint intervals (a,, 6;), then the corresponding 


observations P, have the property that 5 P, = 1, where 7 symbol- 
i =—0o 

izes the fact that the value assumed by the numerical attribute in 

question must fall within some one of the intervals (a;, 0;). 

It is well known that these symbolic operations P; can be repre- 
sented by orthogonal projections on a suitable Hilbert space S, 
and that the totality of projections which can correspond in the 
above manner to a particular numerical attribute comprises the 
spectral resolution of a self-adjoint linear transformation on . 
Conversely, any such linear transformation possesses a spectral 


QUANTUM APPROACH TO GENETIC PROBLEMS 229 


family of projections which can (at least in principle) arise through 
the observations of some numerical attribute of the system. Hence, 
such a transformation on § is called an observable. 

The representation of an observable by a self-adjoint linear 
transformation on a suitable Hilbert space is a crucial notion in 
the abstract quantum-theoretic description of microphysical events. 
From general Hilbert space theory, we may note the following 
properties of observables: if A is an observable, then so is A?, 
A®,..., or in general, f(A), where f(x) is a real function (see e.g. 
Riesz & Nagy 1955 for a precise exposition of the notion of func- 
tion of a transformation); if A,, A, are commuting observables, 
then every real function f(A,, A.) is also an observable. If Ay 
A, do not commute, then special precautions are necessary (see 
e.g. von Neumann, loc. cit.). 

On the basis of the representation described above, we may at- 
tempt to provide an abstract formulation of the notion of a general 
microphysical system, in terms of the observables of which it is in 
some sense composed. For our purposes, a microphysical system 
6 will consist of a collection of observables, subject to the follow- 
ing conditions: 


Si: If A is an observable of G (i.e. if A €G),then every pro- 
jection in the spectral resolution of A is in © also. 

S2: If a family of projections in © form the spectral resolu- 
tion of some observable, then that observable is inG. 

$3: All functions of the observables in 6, which can be 
formed in accordance with the general principles of 
quantum theory are observables in 6. 


It will be readily noticed from these conditions that © is possessed 
of a definite algebraic and topological structure, although the pre- 
cise description of that structure is not immediately apparent from 
these conditions. In fact, the problem of determining a fully satis- 
factory structure on G has not been solved (see Segal 1947, 1955 
for an explicit discussion). Since we shall not require full mathe- 
matical detail throughout most of the present work, the conditions 
S1-S3 above will be ample for our purposes. 

The physical significance of this definition is as follows: the 
quantum-theoretic notion of a microphysical system embodies a sort 
of ‘*Platonic Idea’”’ of the system, in the sense that what one ac- 
tually observes in nature is not the system itself, but a structure 


230 ROBERT ROSEN 


on which the observables comprising the system assume more or 
less definite numerical values. Such a structure is called a state 
of the corresponding abstract microphysical system. In the mathe- 
matical representation we have outlined above, the most common 
kind of state is represented by a unit vector in the associated 
Hilbert space §; the expectation value of the attribute represented 
by the observable A in the state w is defined as the inner product 
(Aw, w). In particular, if wy is a proper vector of A with proper 
value A, then (Ay, w) =A exactly. Other states, not represented 
by vectors of §, arise if the observable A has a non-empty con- 
tinuous spectrum. More generally, the physical significance of the 
spectral properties of A are as follows: a numerical measurement 
of the attribute represented by A yields a number lying in the 
spectrum of A; conversely, each number in the spectrum of A can 
be the result of an observation of a suitable state of the system 6; 
the various types of possible spectrum have implications related 
to theoretical limits of observability. We shall return to these 
matters more fully in Section 3 below. 

Finally, we make the fundamental quantum-theoretic assertion 
that all the information which can possibly be obtained concerning 
the states of a microphysical system is already embodied in the 
above formalism. That is, the quantum-theoretic formalism is as- 
serted to contain the directions for answering any question that can 
meaningfully be asked concerning the behavior of these states. It 
is the opinion of most physicists that this assertion is correct, at 
least on the atomic and molecular levels; we shall explicitly as- 
sume its truth throughout the following development. 


Il. The Genetic System. It is becoming increasingly apparent 
that an understanding of the basic phenomena of biology must be 
sought at the molecular level. Hence it seems most natural to at- 
tempt to apply general quantum-theoretic methods to fundamental 
biological problems, in terms of the general framework outlined in 
the preceding section. In what follows, we attempt to carry out 
such a study, directed in particular at certain aspects of the ge- 
netic structure of biological systems. 

Let us for the time being drop all preconceptions about the na- 
ture of the primary genetic material, and retain only the knowledge 
that some portion of the cellular structure contains information 
which must be transmitted to and utilized by other cellular struc- 
tures in order that the metabolic activity characteristic of the or- 


QUANTUM APPROACH TO GENETIC PROBLEMS 231 


ganism in question can be carried out. We shall now make the fol- 
lowing explicit assumption about this genetic material: 


Gi. Every structure in a cell which is capable of carrying 
genetic information is a definite state of a certain fixed 


microphysical system 6, specified according to the rules 
S1-S3 of Section I. 


The assumption G1 is the simplest one which we can make regard- 
ing the nature of the genetic material. It can, if necessary, be gen- 
eralized in a number of ways, as we shall see below. 

It is evident that before the primary genetic information may be 
utilized properly, there must take place a suitable observation of 
the structures which bear this information, in order that this infor- 
mation be transmitted properly to wherever in the cell it is required. 
We remark explicitly that although this observation is performed by 
a suitable observing system inside the cell, and has nothing what- 
ever to do with a human observer, the fundamental quantum-theoretic 
laws of observation must still be satisfied. Now since this infor- 
mation can only be transmitted by observation, it follows from the 
assumption of the universal character of the quantum-theoretic for- 
malism that a genetic observation must consist in determining the 
numerical values assumed by a certain family of observables in the 
genetic system © on the state bearing the information in question. 

Let us now investigate what conclusions can be drawn concern- 
ing the nature of the particular family of observables of © which is 
concerned with the transmission of the genetic information; we shall 
denote this family by 6. There are, prima facie, two possibilities: 


1. @ contains at least one pair of noncommuting observables. 
2. @ is a set of mutually commuting observables. 


If (1) holds, then general well-known quantum-theoretic principles 
imply that a simultaneous measurement of the values assumed by 
the observables of @ on any state of © is impossible. In particu- 
lar, the observation of any one of the observables of @ to a high 
jegree of accuracy will always introduce random uncertainties into 
the value of any observable which does not commute with this ob- 
servable. In biological terms, (1) would imply that more or less 
large fluctuations would occur in the metabolic behavior of all 
cells, as an unavoidable result of the fact that successive genetic 
observations (i.e. determination of the values assumed by @) on a 


239 ROBERT ROSEN 


state of © will not yield the same result, and hence will provide 
the cell with a different set of ‘‘metabolic instructions.’’ This 
type of phenomenon is not observed. It is of course possible for 
the fluctuations so induced to be so small and so rapid, at least on 
the average, that they cannot easily be detected; however, it seems 
more reasonable to exclude the possibility (1). 

We are thus left with the possibility (2), that © consists of a 
family of mutually commuting observables. In this situation, how- 
ever, we are in a position to apply a theorem due originally to von 
Neumann: 


Theorem 1: Given a family iT of mutually commuting self-adjoint 
linear transformations (i.e. observables) on a separable Hilbert 
space, there exists a self adjoint transformation A and a family 
of real functions if 3 such that for each index 4, we haveT, = 
f(A). 

Proof: See e.g. Riesz & Nagy (1955) p. 356 e¢ seq. 


The theorem is proved by constructing the spectral resolution of 
the desired transformation A in a canonical fashion in terms of the 
spectral resolution of the T,,, so that we can consider the transfor- 
mation A to be precisely the one so constructed. The conditions 
S1-S3 now imply that if for each « we have 7,,€6, then the trans- 
formation A is again an observable of ©. 

The power of Theorem 1 in the present circumstances is the fol- 
lowing: each observable 7&6 can be regarded as a function of a 
single observable A. Thus, the measurement of each T€@ may be 
considered as being specified by a measurement of A. In other 
words, all the information which can be carried by a set 6 of mutu- 
ally commuting observables can already be carried, at least theo- 
retically, by a single observable A, defined as above. Using this 
result as motivation, we now make our second explicit postulate 
concerning the genetic material: 


G2: The genetic information carried by a state w of the ge- 
netic system © is completely determined by the value 
assumed by a single observable A€eG on w. 


In the light of (G2), we might generalize (G1) in such a manner as 
to allow any system of which A is an observable to be capable of 
carrying genetic information. However, this added generality will 
not be needed for our subsequent discussion, and we shall accord- 
ingly retain (G1) in its original form throughout. 


QUANTUM APPROACH TO GENETIC PROBLEMS 233 


Perhaps it would be well to pause at this pointand examine the 
relation between our approach, as outlined above, and the quantum- 
mechanical treatment of the gene originated by Delbriick and ex- 
tended by Schrédinger (1944), as well as the replicative model of 
Rashevsky (1959). In each of these approaches, the only observa- 
ble which is discussed in detail is the energy observable; in fact, 
the assumption seems to be implicitly made that any alteration in 
the functioning of the genetic material is a direct result of some 
energetic transition on the part of the macromolecule which each 
of these treatments supposes to comprise a genetic unit. However, 
it seems quite clear that it is not the energy of such a macromole- 
cule which will uniquely determine its genetic capabilities. There 
is of course no doubt that the energy observable is ‘‘linked,’’ so to 
speak, with the genetic observable A (which does determine the 
genetic capabilities of a state of G),; in the sense that a substan- 
tial alteration of the energy of such a state may cause an altera- 
tion in the information itself. However, such an effect is largely 
secondary, as far as primary genetic effects are concerned. More- 
over, the shifting of attention away from exclusive consideration of 
the energy observable of © opens up many possibilities for geneti- 
cally significant effects in which the energy is not involved at 
all. We can see the inadequacy of solely energetic considerations 
from general quantum-theoretic principles: unless A commutes 
with the energy observable, measuring the energy of a state of the 
genetic system © will in general only determine the probability 
with which A takes on all its possible values in that state. Fora 
fuller discussion of this statistical characteristic of the observa- 
tion of microphysical systems, see especially von Neumann (loc. 
cit.), Chapter III. 

On several counts, therefore, it seems that a study of primary 
genetic mechanisms from a quantum-theoretic point of view requires 
a wider type of treatment than is possible from purely energetic 
considerations. It is hoped that the postulates (G1), (G2) above 
may provide the basis for such a treatment. 

Returning now to the main line of the argument, we have attempted 
to show how primary genetic effects may be related to the proper- 
ties of a single self-adjoint transformation A. However, we have 
@ priori no information whatever about the structure of A (i.e. its 
spectral properties). Thus, we must proceed by enumerating all 
the various possibilities which may obtain, and compare the ge- 
netic consequences of each of these structures with the known 


234 ROBERT ROSEN 


phenomena of genetics. In this way, we may hope to obtain some 
information as to the actual structure of A, and at the same time 
obtain some insight into how the properties of A can manifest 
themselves genetically. 


Ill. Spectral Properties of the Genetic Observable and Their 
Biological Implications. The first question which must be answered 
concerning the structure of a given observable A is whether or not 
A is bounded. We enumerate the two possible alternatives as 
follows: 


Bi: A is bounded. 
B2: A is unbounded. 


A possible immediate corollary of (B2) for genetic theory arises as 
a consequence of the well-known theorem of Hellinger and Téplitz: 


Theorem 2: A self-adjoint transformation A is defined on every 
vector yy € § only if A is bounded. 
Proof: See e.g. Riesz & Nagy (loc. cit.) p. 296. 


Stated another way, Theorem 2 asserts that an unbounded self- 
adjoint linear transformation on a Hilbert space § cannot be de- 
fined everywhere on 5. In quantum-theoretic terms, this result im- 
plies that there exist states of the genetic system © which are not 
capable of bearing genetic information (since the unboundedness 
of A means precisely that A does not assume a value in these 
states). This conclusion bears a close formal analogy to one which 
follows easily from results we obtained in previous works (Rosen 
1959a, 1959c). In these works, we supposed that genetic informa- 
tion was embodied in the order of sequences of generators in the 
words of a suitable free monoid. In the presence of a rather-na- 
tural restriction (‘‘ergodicity’’), it is easy to show that not every 
word in the monoid can carry genetic information under any given 
code. However, we do not wish to dwell on the significance of this 
analogy at the present time. 

We turn now to a description of the possible spectral properties 
of A, and thence to a consideration of the implications of these 
properties for genetic phenomena. In what follows, we shall de- 
note the point spectrum of A by a, (A) and the continuous spectrum 
of A by a, (A). Since A is by hypothesis self-adjoint, it is known 


QUANTUM APPROACH TO GENETIC PROBLEMS 235 


that the set [o, (A)] Ulo, (A)] comprises the spectrum o(A) of A, 
and is a set of real numbers. 

There are three possibilities which may obtain with regard to the 
spectral structure of A, as follows: 


Ri. 0, (A) =@. 
R2. 0, (A) =9. 
R3. Neither (R1) nor (R2) hold. 


If (R1) holds, then o(A) is a denumerable set (possible finite) of 
real numbers. To each number ) € o(A), there corresponds a closed 
subspace Jl, of $, consisting of all vectors we § such that Ay = 
Ay. The dimension of this subspace is called the multiplicity of 
A, and we always have 2.4.4 I, =§. 

Whatever the spectral structure of A, it is known from general 
operator theory that if A is bounded, ‘then o(A) is a compact (ive. 
closed and bounded) subset of the real line. Therefore, it follows 
from the Bolzano-Weierstrass Theorem that if o(A) is an infinite 
set (with A bounded), there exists at least one point of accumula- 
tion in o{A). On the other hand, if o(A) is a finite set (with A 
bounded), then at least one of the eigenvalues of A must be of in- 
finite multiplicity (otherwise the direct sum of the Jl, would be 
finite dimensional). In fact, it readily follows that the only situa- 
tion in which every point of o(A) can be an isolated point with 
unit multiplicity is that in which A is unbounded with pure point 
spectrum. For if either (R2) or (R3) holds, then o(A) is non- 
denumerable whether A is bounded or not, and hence o(A) must 
then always possess at least one point of accumulation. 

We must now discuss the manner in which these spectral proper- 
ties of A can manifest themselves biologically. 

To begin with, let us observe that quantum physics, as such, is 
generally concerned with making statements of an existential na- 
ture in terms of the spectral properties of observables. That is, 
in quantum physics, we predominately find statements of the fol- 
lowing form: given a number A, in the point spectrum of an ob- 
servable, there exists an observing apparatus which can separate, 
from an ensemble of states of a system containing the given ob- 
servable, all and only those states in which the given observable 
assumes the value A,; or, given a number p, in the continuous 
spectrum of an observable, there exists an apparatus which can 
perform this separation to any prescribed degree of accuracy. 


236 ROBERT ROSEN 


Statements such as these are, however, not of immediate interest 
in biology, for biological systems come already equipped with a 
definite apparatus for observing the states of the system which 
bears the genetic information. What we need to study, therefore, 
is not the theoretical optimum for such measurements, but the rela- 
tion between the properties of particular measuring instruments and 
the spectrum of the observable which the instrument is designed 
to measure. 

Thus, let there be given a definite physical apparatus, which 
has been designed to measure the values assumed by some ob- 
servable in specific states of a given system. The feature of this 
apparatus which is of importance for us is the fact that, inherent 
in the construction of the apparatus, we find a definite limit in the 
accuracy with which states can be separated when the closeness 
of their eigenvalues exceeds a certain limit. This limit is usually 
referred to as the resolving power of the apparatus in question. In 
purely classical situations, the resolving power of a macroscopic 
measuring instrument (such as a lens or prism) can be defined in a 
precise manner, as follows; to each apparatus we can associate a 
number € >0, such that whenever the difference between the 
values assumed by some observable in two states of an observed 
system is greater than ¢€, then the two states can always be dis- 
tinguished by the apparatus; if this difference is less than ¢, then 
the two states will never be distinguished by the apparatus. In 
quantum-theoretical systems, however, we must replace the notion 
of a precisely defined resolving power by essentially probabilistic 
ideas: given any two states of a quantum mechanical system-under 
observation by a particular observing apparatus, there exists a 
definite probability that the apparatus will distinguish between 
them. If the eigenvalues corresponding to the two states in ques- 
tion are sufficiently remote, then this probability will be close to 
unity; if they are sufficiently proximate, the probability of resolu- 
tion will be close to zero. 

We may now attempt to apply these ideas to the genetic observ- 
able A, with respect to the definite observing apparatus with which 
biological systems determine the values assumed by A on the vari- 
ous states of the genetic system 6. We may safely assume that 
the resolving power (in the quantum-theoretic sense) of this ap- 
paratus is high, but it is nevertheless finite. In more precise 
terms, for each point A, €o(A), there exists a neighborhood N (A,) 


QUANTUM APPROACH TO GENETIC PROBLEMS 237 


of A, such that for any number J in [o(A)] A [N (A,)] the probability 
that the apparatus will fail to distinguish between ) and A, is ap- 
preciable (this probability in fact increases monotonically to unity 
as the radius of N(A,) approaches zero). Thus, we conclude that 
no matter how high the resolving power of the apparatus in ques- 
tion, it must nevertheless be exceeded in any neighborhood of a 
point of accumulation of a(A). As we have seen (p. 235), the only 
situation in which it is possible for every genetic observation to 
be completely precise (up to a negligible probability of error) is 
when the genetic observable A is unbounded with pure point spec- 
trum, and satisfying the further condition that no two eigenvalues 
of A lie so close together that there is an appreciable probability 
that the observing apparatus will fail to distinguish between the 
corresponding states. 

We have thus seen that if o(A) possesses points of accumula- 
tion, then no matter how high the resolving power of the genetic 
observing system, it may nevertheless happen that some cell con- 
tains two states of © the eigenvalues of which lie so close to- 
gether that the probability of these states being separated by the 
apparatus is significantly less than unity. Let us now attempt to 
describe the possible behavior of biological systems in which this 
situation obtains. For simplicity, we envision a large number of 
identical such systems, each of which contains two states yw, w’ of 
©, such that the corresponding eigenvalues , »’ lie so close to- 
gether that the probability p of their being distinguished by the 
cell’s observing system is much less than 1. There is no @ priori 
reason to suppose that the phenotypes (i.e. the metabolic imple- 
mentation of the genetic information carried by y, w’) are related; 
let us therefore assume that the phenotype corresponding to w’ is 
lethal (i.e. results in a destruction of the metabolism of a system 
bearing it.) We remark explicitly that since each member of the 
ensemble of biological systems under consideration are assumed 
initially viable (i.e. are expressing the phenotype corresponding to 
ys), the probability p defined above becomes in this case the proba- 
bility that a particular member of the ensemble will be capable of 
distinguishing y’ from w. 

The observed behavior of such an ensemble will be very simple. 
Most of the systems will remain viable (i.e. exhibit the yw pheno- 
type); occasionally a system will die. In fact, if there are N sys- 
tems in the ensemble at a time ¢, and N is a sufficiently large 


238 ROBERT ROSEN 


number, then pN of the systems can be expected to exhibit the 
lethal phenotype at time ¢. 

An example of this type of behavior seems to be provided by 
various lysogenic strains of bacteria. In these bacteria, there is 
apparently present in the genome a character called prophage, which 
becomes activated on a purely statistical basis, and alters the 
phenotype of bacteria in such a manner as to cause them to pro- 
duce particles of active bacteriophage, and hence to cause their 
own lysis and death. If we suppose that the ‘‘prophage’’ locus 
corresponds to a state yw’ satisfying the conditions described 
above, ‘then we obtain a situation which seems to account for the 
experimental facts of lysogeny quite well. In particular, the two 
fundamental characteristics of lysogenic bacteria (Jacob and Woll- 
man 1957) are immediate consequences of this stiuation. These 
characteristics are: 


(a) The ability of every cell of a lysogenic population to produce 
virulent bacteriophage. 

(b) The complete immunity of bacteria of a lysogenic population 
to superinfection by virulent phage homologous to those aris- 
ing by activation of the prophage locus. 


That (a) follows from the above discussion is evident. The property 
(b) can be accounted for in a number of different ways, depending 
on the role that phage DNA is assumed to play as a genetic agent 
(see Section 4 below). As one possibility, we may assume that 
the infective phage DNA actually contains a state w’ of the genetic 
system ©. We further assume that the bacteria of a lysogenic 
strain contain, as part of their genome, a state w of © which lies 
very close to w’, and which is not possessed by non-lysogenic 
bacteria. Therefore, a non-lysogenic strain will immediately (with 
a high probability) be able to recognize the state J’ and carry out 
its metabolic instructions (i.e. lyse), while the bacteria of a lyso- 
genic strain will be able to distinguish the state w’ from the state 
y, which is already present in their genome, with a small proba- 
bility. It will be observed that this picture is in good qualitative 
agreement with the actual behavior of different bacterial strains 
when exposed to bacteriophage. Moreover, the probability that a 
lysogenic bacterium will distinguish between the states w and y’ 
is not affected by the number of copies of y which are present, 
but only on the distance between the corresponding eigenvalues. 


QUANTUM APPROACH TO GENETIC PROBLEMS 239 


Therefore, the superinfection of lysogenic strains, which on the 
basis of the above picture means precisely the incorporation of 
further copies of w’ into the strain, will not increase the frequency 
of lysis in such strains beyond the ‘‘spontaneous’”? rate. 

We may remark that it is by no means necessary to regard the 
phage DNA as actually corresponding to a state of the genetic ob- 
servable, as was done above for the sake of illustration. We shall 
return to the question of the possible roles of DNA in genetic sys- 
tems in Section 4. The primary purpose of the preceding discus- 
sion was to show how relatively simple arguments, based on a 
general quantum-theoretic picture of the genetic apparatus of bio- 
logical systems, are sufficient to provide us with a model on the 
basis of which we may quantitatively discuss (correctly or not) 
such a fundamental phenomenon as lysogeny. In fact, it will be 
noticed that the above picture (correct or not) is in some ways 
less ad hoc than the more customary explanation for lysogenic 
strains; in the more usual treatment, prophage is regarded as a 
gene, the phenotype of which is manifested in the resistance of 
bacteria to homologous phage (thereby accounting for property 
(b) above), and which mutates periodically to give rise to virulent 
phage (property (a)). However, it is not our present purpose to go 
deeply into the theory of lysogeny, but merely to indicate how the 
general quantum-theoretic considerations outlined above can lead 
to immediate biologically testable conclusions. 

Thus far, we have only considered the manner in which points 
of accumulation of o(A), coupled with the finite resolving power 
of the observing apparatus, can lead to biologically interesting 
situations. We now turn to a discussion of the implications of the 
existence of multiple eigenvalues in o(A). In this case, it will be 
remembered, there may exist many distinct (i.e. ‘‘orthogonal’’) 
states corresponding to the same eigenvalue of A; the number of 
such states being the multiplicity of the eigenvalue. In quantum 
physics, an eigenvalue with multiplicity greater than one is called 
degenerate. 

The situation in the case of a degenerate eigenvalue is quite 
different from that which obtains in the neighborhood of a point of 
accumulation of o(A). In the former case, the several states corre- 
sponding to a degenerate eigenvalue can never be separated by an 
observation (always with reference to the single observable under 
consideration, of course), however precise this observation may 


240 ROBERT ROSEN 


be; in the latter case, there always exists a finite probability that 
closely neighboring states can be separated, the probability in- 
creasing with increasing resolving power of the observing apparatus. 

However, although degenerate states (the states corresponding 
to degenerate eigenvalues) cannot be separated by observations 
under ordinary circumstances, it is possible for them to manifest 
themselves by interaction with other systems. This type of situa- 
tion, in which an observed system is not to be considered in isola- 
tion, but in interaction with its environment, is usually referred to 
as a perturbation of the system. We must therefore study the man- 
ner in which these perturbations may give rise to biologically sig- 
nificant effects. Before we undertake this study, we shall briefly 
describe those aspects of general perturbation theory which we 
will require. 

Speaking roughly, a system 2 which is being perturbed can be 
regarded, from the formal standpoint, as equivalent to a system Q’ 
in isolation, where the observables which define %’ differ from 
those of Z by a ‘‘small’’ amount; this is the basic idea of perturba- 
tion theory. In greater detail, let S denote a definite unperturbed 
system, and let A,, A,, ... be the observables which generate % 
(in a suitable algebraic sense). We shall denote the perturbed sys- 
tem by 2’, and we shall assume that 3’ is generated by the observ- 
ables A,’, A,’..... The relation between A, and A,’, for each 2, 
is assumed to be given by 


A’ A,+8AM+59AM+.., (1) 


where €is a (small) parameter, and the 4 are suitable self-adjoint 
transformations, depending on the perturbation in question. In 
quantum physics, the first two terms of (1) generally suffice. 

We now inquire what is the relation between the states of 3 and 
%* respectively. In the present situation, ‘ve are interested only in 
the effect of the perturbation on a single observable (the genetic 
observable), which we shall assume to be given by (1); conse- 
quently, we restrict our attention to eigenstates of that observable. 
A full analytic answer to the question of how these states trans- 
form under perturbation is long and complicated, and depends ulti- 
mately on the properties of the transformations A‘, Therefore, we 
shall content ourselves with a qualitative description of the perti- 
nent facts. For a fuller treatment, we refer the reader to the litera- 
ture; Riesz and Nagy (loc. cit.) contain a useful discussion of per- 


QUANTUM APPROACH TO GENETIC PROBLEMS 241 


turbation theory. However, the treatment of this theory to be found 
in most textbooks dealing with quantum mechanics is very 
specialized. 

The states of 2” will in general be different from those of $; 
however, if the perturbation is sufficiently small, they will not be 
too much different. Therefore, the eigenvalues corresponding to 
the perturbed and unperturbed states will also not differ too much 
from each other in this situation. What is of particular interest for 
us is the behavior of the degenerate states under perturbation. On 
the basis of what has already been said, it can be seen intuitively 
that if Y,, Ww, are two states corresponding to a single degenerate 
eigenvalue A of A, there is no reason to suppose that the corre- 
sponding states ¥,’, ¥, of A’ should still correspond to a single 
eigenvalue. The precise calculations verify this intuitive picture. 
There are many physical examples of this type of effect, such as 
the splitting of various lines in atomic spectra under the applica- 
tion of a magnetic field (Zeeman effect). 

However, the perturbation of an observable may produce other ef- 
fects besides the splitting of degenerate eigenvalues. These other 
effects depend upon the spectral structure of the observable in 
question, and the nature of the perturbation. Thus, under appropri- 
ate conditions, eigenstates which were initially distinct in the un- 
perturbed system may become degenerate in the perturbed system; 
points of the continuous spectrum can become points of the dis- 
crete spectrum and conversely (see especially Friedrichs 1948 for 
a discussion of the possible effects of perturbing an observable 
with a non-empty continuous spectrum). Physical manifestations 
of most of these effects are known, although simple examples are 
not abundant. 

What are the possible genetic implications of these considera- 
tions? To begin with, let us recognize explicitly that the capacity 
of any single biological system (i.e. cell) to utilize genetic infor- 
mation is very strictly limited. That is, there are many states of 
the genetic observable which cannot be ‘‘deciphered’’ by the sys- 
tem in question, in the sense that such states do not enable the 
cell to assume a corresponding phenotype. This limitation has 
been proved repeatedly by many different kinds of experiments; 
whether it is caused by inherent deficiencies in the observing ap- 
paratus of cells or is due to other effects it is not possible to say. 
We merely wish to emphasize here that the capacity of a biological 


242 ROBERT ROSEN 


system to utilize genetic information is sharply and characteristi- 
cally circumscribed. This limitation will be of importance for the 
following discussion. 

Let us begin by considering a state y of the genetic system 6, 
where w corresponds for the moment to a simple eigenvalue A of A 
(i.e. we can write Aw = Aw). If we now perturb the system 6, then 
we in effect replace the genetic observable A by a new observable 
A’, where A’ is given by an expression of the form (1). The state 
yw will then be replaced by a state y’ which is an eigenstate of A’, 
corresponding to a value ’ (i.e. we have A’ ’ = A’ yp’). Now 2’ is 
in general different from 4; an exact expression for A” can be writ- 
ten down if the perturbation is known (vide Riesz & Nagy, loc. 
ctt.). 

The operational significance of the perturbation is thus as fol- 
lows: subsequent to the perturbation, the observing apparatus of 
the cell will report the value A’ for the state in question. If the 
perturbation is very small, so that A’ lies so close to A that the re- 
solving power of the observing apparatus is not exceeded, then 
(with high probability) the system will behave as if no perturbation 
has taken place. On the other hand, if the perturbation is suffi- 
ciently large so that the observing apparatus can differentiate be- 
tween A and )’ with an appreciable probability, then a phenotypic 
indication of this state of affairs is to be expected. 

The actual phenotypic behavior of the system is contingent on 
whether or not A’ is sufficiently close to another eigenvalue yp of 
the unperturbed genetic operator A. In any case, however, we can 
assert that the phenotype corresponding to the eigenvalue \ has 
disappeared from the system. If A’ is so close to p that the ob- 
serving apparatus will report the perturbed state WJ’ as being the 
unperturbed state @ corresponding to the eigenvalue p, then the 
system will assume the phenotypic trait corresponding to p if it is 
possible for it to do so; if this is not possible (i.e. if the state o 
corresponds to genetic information which the cell in question can- 
not utilize), or if \” is not sufficiently close to another eigenvalue 
of A, then the only phenotypic manifestation of the perturbation will 
be the disappearance of the trait corresponding to \ from the pheno- 
type of the system. 

Let us notice that we have assumed, throughout the above dis- 
cussion, that the perturbation under consideration is acting through- 
out the entire time in which biological observation can take place. 


QUANTUM APPROACH TO GENETIC PROBLEMS 243 


An example of such a perturbation might be the following: suppose 
we consider, provisionally, that the genetic information is carried 
by molecules of DNA. If we substitute various analogs for the 
purine and pyrimidine bases occurring in DNA, we may obtain mol- 
ecules which are still capable of carrying genetic information, but 
may not carry the same information as the unsubstituted molecules 
did. The structural alteration arising from such substitutions may 
be considered as inducing a perturbation of the observable A, at 
least with respect to the particular state under consideration; such 
a perturbation will remain in effect as long as the molecule retains 
its integrity; i.e. for practical purposes, such a perturbation acts 
infinitely long. 

On the other hand, if we have a perturbation which acts only mo- 
mentarily, so to speak, we get somewhat different results. Let us 
denote the perturbed state by yw’, and let us suppose that w~’ arose 
through perturbation of the state ~. The general theory of such 
momentary perturbation tells us that when the perturbation is re- 
moved, yy’ will revert to a state ¢ of the original system. There is 
no reason why we should have 9 = y; in fact, it follows from the 
general theory that there is a finite probability (the transition prob- 
ability) that the initial state w will be transformed to any state of 
the unperturbed system as a result of the perturbation (subject of 
course to the various conservation rules of quantum mechanics, 
such as spin, angular momentum, etc.; transitions which violate 
such rules are called forbidden, and do not ordinarily occur in 
nature). 

Perturbations of this momentary type may perhaps be induced by 
radiation, although the gross structural alterations and cross- 
linking of polymers induced by high-frequency radiation are almost 
certainly of a permanent nature. It is readily seen, nevertheless, 
that the effect of a transition induced by a ‘‘temporary’’ perturba- 
tion will correspond to what is referred to as a point mutation. If 
point mutations do indeed arise in this manner, then it may be pos- 
sible to obtain some information about the various transition proba- 
bilities between states of the system ©. For let us assume, as 
genetic experience seems to indicate, that in any particular bio- 
logical system the various states of 6 which are present will oc- 
cupy definite, more or less determinable positions relative to one 
another in the genome. Let y correspond to a state of © contained 
in the genome under consideration, and let us denote its locus by 


244 ROBERT ROSEN 


6. If » is another state in the genome, occupying some other 
locus 6’, and if the transition y — @ is not forbidden, then there 
exists a finite probability that a temporary perturbation will induce 
the transition y — ». The effect of this transition would be the 
appearance of the genetic characteristic corresponding to @ at the 
locus 6. Effects of this nature should be experimentally detect- 
able under certain circumstances; the paucity of such effects in 
the literature may indicate that the great majority of such transi- 
tions are actually forbidden. It may be remarked that the forbidden- 
ness of most such transitions may be considered as a safety de- 
vice to ensure stability in the metabolic functioning of the genetic 
system; therefore, systems bearing only states of © between which 
transitions are forbidden would perhaps tend to have a selective 
advantage over systems with other arrangements. 

It should be clear that the genetic effects caused by the tem- 
porary and the permanent type of perturbation will have quite dif- 
ferent characteristics. First, the permanent perturbation should 
show little tendency to revert, while the temporary perturbation 
may (although it need not) revert with a comparatively high fre- 
quency. Further, the temporary perturbation, after transition, will 
replicate perfectly, since it is a bona fide state of the genetic 
system ©; the permanent perturbation, on the other hand, may repli- 
cate in an aberrated fashion, since it is no longer a state of ©. 
It must be observed that although these distinctions are prima facie 
more or less readily observable, we have ignored the many other 
important processes which interpose themselves between the ob- 
servation of the genetic information and the manifestation of the 
corresponding phenotype, and hence our discussion may give only 
approximate criteria for their recognition. Nevertheless, Benzer 
and Freese (1958) report significant qualitative differences between 
the behavior of mutant bacteriophage prepared with DNA analogs 
and mutant phage which arise ‘‘spontaneously.”’ 

We have thus far restricted our attention to the effects of the 
various types of perturbations on simple eigenstates of the genetic 
observable A. We must now consider the effects of such perturba- 
tions on degenerate eigenstates of A. We recall that, under any 
perturbation, the various states corresponding to a degenerate 
eigenvalue will in general be separated into distinguishable states. 
More exactly, if wy, wa,.--y wy, correspond to the eigenstates of 
the eigenvalue A of A (where \ has multiplicity n), then a perturba- 


QUANTUM APPROACH TO GENETIC PROBLEMS 245 


tion will in general transform these states into the states ur 


Yo’,++-, W,’ which correspond to distinct eigenvalues of the per- 
turbed observable A’. From the purely genetic standpoint, it is 
quite clear that any one of the states wy, (@=1, 2,..., n) could 


equally well occur at the particular locus 6 in the genome which is 
observed to correspond to the phenotypic manifestation charac- 
teristic of the value ». In fact, we may assume that a naturally 
occurring population of biological systems bearing information 
corresponding to A will contain a definite distribution of the states 
y, at locus 6, in much the same manner as a natural population of 
atoms of any chemical element contains a definite distribution of 
isotopes which exhibit identical chemical properties. 

If we apply a suitable perturbation to such a population (which 
of course will appear genetically homozygous for the character 
corresponding to A), we find that we obtain a number of genotypi- 
cally distinguishable cells, the number of different genotypes de- 
pending on the multiplicity n of X. As far as phenotypic manifesta- 
tion of these different genotypes is concerned, the same remarks 
that we made in connection with the same problem in the case of a 
simple eigenvalue are applicable in the present situation also. 

What we wish to point out in this connection is that, under cer- 
tain circumstances, the perturbation of a multiple eigenvalue of A 
may give rise to phenomena whose effects bear a close resemblance 
to the production of multiple alleles or pseudoalleles of a gene. 
Let us observe that, by definition, the only distinction between 
the phenomena of pseudoallelism and multiple allelism is the 
empirical observation that crossing-over is observed (albeit rarely) 
between pseudoallelic genes, and has never been observed between 
multiple alleles. Continuing investigation has already shown that 
an ample number of loci which had heretofore been regarded as 
the seat of multiple alleles are actually compound. In fact, it ap- 
pears to be the opinion of some investigators that all cases of 
multiple allelism are actually very closely linked pseudoalleles. 
Such a situation would correspond very well with the picture de- 
scribed above, in which allelism is taken to result from the per- 
turbation of a population bearing a distribution of states corre- 
sponding to the same degenerate eigenvalue of A. Moreover, it 
must be pointed out that the determination of the locus of a given 
gene is itself a microphysical observation, of a specific type. 
There is no reason to exclude the possibility that a perturbation of 


246 ROBERT ROSEN 


sufficient magnitude to alter the genetic information carried by 
some state of the observable A will also affect the observable which 
corresponds to the locus of the state. The rules of microphysical 
observation then apply: if the shift of locus which occurs in this 
manner is smaller than the resolving power of the appropriate ob- 
serving apparatus, then the genes in question will be regarded as 
multiple alleles; otherwise they can be resolved by the apparatus, 
and crossing-over may take place between them with a frequency 
proportional to the probability of resolution. This type of picture 
lends some support to the regarding of multiple allelism and pseudo- 
allelism as basically the same type of phenomenon. 

If the production of alleles can indeed occur in the manner de- 
scribed above, then we immediately obtain as a corollary the im- 
portant result that not every gene (i.e. state of 6) can give rise to 
a series of alleles. In order for allelism to be possible, it is 
necessary that the corresponding eigenvalue of the genetic ob- 
servable A be degenerate; the multiplicity of this eigenvalue is 
then the maximal number of alleles possible in the series. We 
draw the further conclusion that if the above considerations are at 
all correct, then the genetic observable cannot possess a pure 
point spectrum with every eigenvalue simple. This in turn implies 
the possible occurrence of the other effects discussed above. It 
also implies that a coding model for the transmission of genetic 
information cannot exist (at least with regard to models of the 
type customarily assumed in the biochemical literature); we shall 
return to a discussion of this question in Section 4. 

We have shown that a perturbation of © can give rise to effects 
which closely resemble the production of allelic series. However, 
we mentioned earlier that a perturbation may be responsible for 
other effects besides the splitting of degenerate eigenvalues. 
Some of these effects could, at least theoretically, give rise to 
genetically observable phenomena. 

For example, it was mentioned that a perturbation could under 
certain circumstances cause a point of the continuous spectrum of 
an observable to be transformed into a point of the point-spectrum 
of the perturbed observable, and conversely. Genetically, this 
type of phenomenon might manifest itself in a change in the ap- 
parent ‘‘stability’’ of certain genes, since the transition from con- 
tinuous spectrum to point-spectrum will in general decrease the 
probability of errors arising from the finite resolving power of the 


QUANTUM APPROACH TO GENETIC PROBLEMS 247 


genetic observing apparatus, whereas a transition from point- 
Spectrum to continuous spectrum will tend to increase this proba- 
bility. Many instances are known of genes which show an altera- 
tion in their mutation rate when their environment is changed (e.g. 
when they are embedded in an altered genome). It is possible that 
these new environments may contain perturbing influences which 
are capable of causing the observed alterations in stability accord- 
ing to the above picture; however, ‘such notions are very specula- 
tive. 

Moreover, under certain circumstances, a perturbation may cause 
states which originally correspond to distinct eigenvalues of an 
unperturbed observable to correspond to a single degenerate eigen- 
value of the perturbed observable. This is an effect inverse to the 
splitting of a degenerate state. Qualitatively, it is readily seen 
that many different effects of such a perturbation are possible, de- 
pending on the states involved and their phenotypic effects; one 
possible application which comes immediately to mind deals with 
the phenomenon of gene conversion. However, the situation in 
which a phenomenon resembling conversion can occur is of a rather 
specialized nature, and we shall not pursue the detailed discus- 
sion here; we hope to return to these matters in another place. 

Thus, we have seen that the assumption that genetic information 
is carried in biological systems by a single observable of a definite 
quantum-theoretic system leads to a number of predictions which 
are, in principle at least, empirically verifiable. Some of these 
predictions deal with effects whose properties seem formally 
similar to well known genetic phenomena; others do not seem to 
have ever been observed. In the following section, we shall in- 
vestigate some more general conclusions which follow from our 
assumption concerning the structure of the genetic system. 


IV. Some Further Considerations. Let us first investigate the 
manner in which a system ©, capable of carrying genetic informa- 
tion (i.e. possessing the operator A as one of its observables) can 
be constructed out of simpler systems. In other words, we inquire 
how the various states of 6, which are (microphysical) objects in 
the world, can be synthesized from simpler such objects in accord 
with the laws of physics. An answer to this question would not 
only give insight into the manner in which these states are con- 
structed (and replicated) in actual metabolizing systems, but would 


248 ROBERT ROSEN 


also throw some light on the mechanisms involved in the genesis 
of these states through inorganic means; in this manner, questions 
of this nature have a direct bearing on problems related to the ori- 
gin of life. 

A quantum-theoretic treatment of this question would proceed 
along the following lines: quite generally, given n systems Gi 
G,,.-.,6,, we can construct a composite system © =X7_, ©; 
from the ©, in a canonical manner, as follows: let the Hilbert 
space associated with the system ©, be denoted by 5 ; Then the 
Hilbert space associated with © is given by the tensor product of 
the spaces § ,, and every observable of © can be expressed as a 
tensor product of suitable observables of the systems ©,. We are 
here interested in the converse problem: given a system G, and an 
observable 4¢6, how can we write the corresponding Hilbert 
space § as a tensor product 6 =§,®5,@...®, where the §, 
are the Hilbert spaces corresponding to systems ©,, and such that 
the genetic observable AeG assumes the form A=A,® 
A,®...@A_, each A; being a suitable observable of the system 
6 ,? 

A solution to questions of this type, and in fact even their pre- 
cise formulation, depends upon a careful investigation of the al- 
gebraic and topological structures imposed on the set of observy- 
ables which, in the abstract, comprise the system 6. It will be 
recalled that we sidestepped this question entirely in Section 1 
above, mainly because a completely satisfactory set of postulates 
for the representation of microphysical systems has not as yet 
been forthcoming (although some proposals have very great utility). 
Further, all the theories which have been proposed up to this time 
involve the use of rather formidable mathematical apparatus (such 
as the theory of C*-algebras or the operator rings of Murray & von 
Neumann). Primarily for the sake of illustration, we shall cite the 
results relevant to the questions proposed above which are obtain- 
able by regarding a microphysical system © to be represented by a 
ring of operators. This theory has been chosen because it is the 
one most readily accessible in the literature, to which we refer 
the reader for a fuller discussion of these matters (see especially 
Kadison 1958 for a useful review and bibliography). 

The theory of operator rings shows that, if a system © is ex- 
pressible as a composite of non-interacting subsystems 6 is as 
then the corresponding rings of operators & 1» &, have the follow- 


QUANTUM APPROACH TO GENETIC PROBLEMS 249 
ing properties: 


(a) The smallest ring containing R, and R, is the ring & of ob- 
servables of ©, 

(b) Every observable in R, commutes with every observable in 
R.,, and conversely. 

(c) Any observable in the intersection of R, and R, is a con- 


stant (i.e. a scalar multiple of the identity observable). 


Subrings of ® which satisfy these three conditions are called fac- 
tors; the decomposition of ® induced by ®,, R, is called a factori- 
zation of R. Thus, the physical problem of partitioning a micro- 
physical system into a family of (non-interacting) subsystems 
translates precisely (under the assumption that the theory of op- 
erator rings is adequate for the representation of these systems) 
into the purely mathematical problem of finding factorizations for 
the corresponding rings of operators. 

One of the main difficulties of this abstract approach, as far as 
practical results are concerned, is the non-existence of criteria for 
determining which factors are actually capable of serving as repre- 
sentations for real (i.e. possible) microphysical systems. This 
problem seems to be completely ignored in the literature. Another 
form of the same problem arises in the representation itself, in the 
form of the following pseudo-paradox: the states of a microphysical 
system are always taken to form a separable Hilbert space. All 
such spaces are isomorphic; hence all systems should be abstractly 
identical. The key to these statements may perhaps lie in the fact 
that these Hilbert spaces, at least insofar as they arise in quantum 
physics, seem to be possessed of a multiplicative structure in 
addition to their structure as Hilbert spaces; this multiplicative 
structure arises from the pointwise multiplication which always 
exists in function-spaces (the spaces which have arisen in quantum 
physics being spaces of complex-valued functions). Different such 
spaces may well possess different multiplicative structures; these 
in general will not be preserved under Hilbert-space isomorphisms, 
and hence may provide a basis for an abstract distinction between 
microphysical systems. These multiplicative structures will also 
impose conditions on the corresponding ring of observables which 
acts on these spaces; these properties may be useful in obtaining 
criteria for those rings which can actually serve to represent defi- 


nite microphysical systems. 


250 ROBERT ROSEN 


As the theory stands now, however, it appears that a factoriza- 
tion of an arbitrary ring of operators, in the sense described above, 
is always possible. In physical terms, this implies that any micro- 
physical system can be expressed as a composite of non-interacting 
subsystems. As far as genetic systems are concerned, this result 
is not surprising, since the various states of the genetic systems 
are repeatedly being synthesized from simpler units in biological 
systems. However, there is at the moment no reason to suppose 
that such a factorization is unique; in fact, a ring of operators may 
generally be factored in many distinct ways. This in turn implies 
that it is (at least theoretically) possible for the states of the ge- 
netic system © to be synthesized in many different ways from 
states corresponding to different families of subsystems of G. In 
biochemical terms, this means that there exist many different path- 
ways and modes of synthesis of the states of G6. The possibility 
then arises that not all organisms utilize the same mode of syn- 
thesis in their replication of these states, and further, that these 
modes may in turn differ from the mode of synthesis whereby these 
states may be constructed out of inorganic materials, without the 
intervention of a biological system. 

On the other hand, if it should be found that the factorization of 
© is indeed unique, this would imply that there is only one way in 
which the state of 6 can be synthesized from the states of simpler 
subsystems. Hence the mode of synthesis would have to be the 
same in all organisms, and in fact, would have to be the same mode 
by which these states can be synthesized inorganically. This 
would in turn imply that the mode of synthesis of these states, and 
hence certain fundamental properties of living matter, are essen- 
tially unique. 

As mentioned above, we may effectively discard the possibility 
that no factorization of © exists. 

It should be remarked that the above argument can only be an 
approximation to the true state of affairs, since we have explicitly 
assumed that the subsystems ©, into which the genetic system © 
can be factored are non-interacting. Intuitively, it seems probable 
that such a restriction is far too strong. However, the non-inter- 
acting case is at present the only situation which can satisfactor- 
ily be treated from a formal point of view; the usual procedure in 
quantum physics is to assume that any effects of interaction can 
be treated as perturbations of the non-interacting case, and the 


QUANTUM APPROACH TO GENETIC PROBLEMS 251 


perturbation theory described in the preceding section is then ap- 
plied. This technique has yielded satisfactory results in quantum 
physics, so that it may be assumed valid in the present case also. 

We may remark here that further quantum theoretic questions of 
the type mentioned above, though with a somewhat different em- 
phasis, have been discussed by N. Rashevsky (1959). 

Finally, we shall turn to the consideration of two related prob- 
lems, one of which has been already touched on above. They are: 
(a) what is the relation of our quantum-theoretic formalism to cur- 
rent biochemical concepts of the coding of genetic information? 
and (b) what are the most likely biochemical candidates for the role 
of the various states of the genetic system 6? It is readily seen 
that an answer to one of these questions may have very great in- 
fluence on the answer to the other. For simplicity, let us first 
consider the problem of coding. 

In previous work, we have considered at some length a class of 
codes which are in a sense the most general codes compatible with 
current biochemical information (vide Rosen 1959a, 1959c). In that 
work, we have shown that there are a countably infinite set of dif- 
ferent units capable of carrying coded genetic information. Ac- 
cording to our present approach, each one of these units must cor- 
respond to a different proper value of the genetic observable 4, 
and conversely; @ fortiori we must have a ,(A)=9. If the coding 
is to be 1-1, as is invariably assumed, we must have A unbounded, 
with each eigenvalue isolated and of unit multiplicity. The fact 
that A must be actually unbounded (so as to be free from points of 
accumulation) and infinite (to avoid multiplicities greater than 
unity) means that infinitely many different states of © must be ca- 
pable of existence without violating the laws of physics. Further, 
the properties of o(A) just outlined, which are essential to insure 
a 1-1 coding process, imply that none of the genetic effects dis- 
cussed in the preceding section, arising because of the presence 
of points of accumulation or multiple eigenvalues ing (A), can pos- 
sibly occur. Stated another way, the assumption of the existence 
of a coding process satisfying the conditions laid down in loc, cit. 
is incompatible with the existence of quantum-theoretic effects in 
the genetic system. That is, the coding process envisioned in the 
current biochemical literature is essentially macroscopic in nature. 

On the other hand, if it should be found that the quantum-theoretic 
effects discussed in the preceding section do occur, then we must 


252 ROBERT ROSEN 


conclude that either the coding principle is entirely incorrect, or 
else that the code is of a completely different type than has been 
envisioned heretofore. This particular point should be found sus- 
ceptible of a direct experimental test, and is therefore essentially 
decidable. 

The problem of coding is related to the question of the physical 
nature of the states of © by the fact that DNA, which has been 
strongly implicated in primary genetic processes by a large num- 
ber of experimental results (and is therefore a most natural candi- 
date for these states) can apparently code information in the order 
of the sequence of its component nucleotides. Several investi- 
gators have recently raised doubts as to the cogency of the evi- 
dence implicating DNA in fundamental genetic processes; if they 
are correct, then the question of coding loses much of its signifi- 
cance. It may not be amiss therefore briefly to review the situa- 
tion, in the light of our discussion above. 

There are at least four different types of experimental evidence 
which indicate that DNA is implicated in fundamental genetic 
mechanisms. These are: (a) DNA seems to occur exclusively on 
the chromosomes of most cells, (b) the amount of DNA per chro- 
matid is constant and apparently free of turnover, (c) DNA can ap- 
parently cause phenotypic changes in micro-organisms, and (d) the 
ultraviolet absorption spectrum of DNA corresponds well with the 
curve obtained when the mutagenic effect of the light is plotted 
against its frequency. Chayen (1958) provides a detailed critique 
of the first three of these general results, pointing out that in each 
case there exist organisms in which the property in question is ap- 
parently not satisfied. Chayen even goes so far as to suggest that 
the primary genetic information is in fact not carried by DNA; such 
a suggestion was also proposed by Hughes-Schrader (1953), cited 
in Goldschmidt (1958), as a result of his studies of the chromo- 
somes of closely related species of mantids. At the very least, it 
seems safe to say that there is a small but obstinate body of data 
which cannot easily be reconciled with the notion that DNA is the 
bearer of primary genetic information. 

It thus seems reasonable, in the light of the above, to ask the 
question: is the conclusion that DNA carries the primary genetic 
information the only one which is consistent with the experimental 
data (a)-(d) above? We shall now attempt to make it plausible that 
the answer to this question may be ‘‘no.”’ Explicitly, we shall try 
to show that this data seems to agree equally well with the hy- 


QUANTUM APPROACH TO GENETIC PROBLEMS 253 


pothesis that DNA forms a portion of the observing apparatus 
whereby the states of © are recognized in a biological system. 
That is, a cell lacking in a certain kind of DNA may not be able to 
observe a corresponding family of states of G, in much the same 
manner that, for example, a radio receiver cannot detect a frequency 
which its tuning circuits are not equipped to receive. The inser- 
tion of an appropriate DNA molecule into the cell may then enable 
such states to be observed, just as the insertion of the proper tun- 
ing circuits will enable a radio receiver to detect any particular 
frequency. In fact, it is difficult to envision a direct experimental 
test which would be capable of distinguishing between these two 
interpretations of the available data. 

Consider, for example, the transformation phenomena in bacteria, 
in which DNA from a donor strain, when brought into contact with 
an appropriate recipient strain, causes a certain proportion of the 
recipient bacteria to assume some distinctive phenotypic property 
of the host strain, which will then be passed on genetically to all 
the descendants of these recipient ‘‘transformed’’ bacteria. The 
customary interpretation of these phenomena, as is well known, is 
that the DNA contains the primary genetic information correspond- 
ing to the phenotypic property induced in the recipient strain. 
However, on the basis of the above comments we might envisage 
the following interpretation: the recipient cells are assumed actu- 
ally to carry the genetic information corresponding to the ‘‘trans- 
formed’’ phenotype, but are unable to observe it, due to some de- 
fect in their observing apparatus. The DNA from the donor strain 
serves to remedy this defect, so that the primary information which 
was previously unusable by the recipient strain becomes capable 
of direct utilization by the cell, which then assumes the “‘trans- 
formed’’ phenotype. 

This line of argument can readily be applied to all other data 
implicating DNA as the primary genetic material; the reader may 
find it interesting to construct the appropriate arguments for him- 
self. We obtain in this manner a picture which seems no more ad 
hoc than many of the more customary interpretations; it has, more- 
over, the virtue that arguments such as those of Chayen and 
Hughes-Schrader cited above lose much of their cogency if DNA is 
in fact not the bearer of primary genetic information. And we may 
still further argue that if two quite distinct pictures exist whereby 
the known data can be interpreted, there may yet be others, which 
have not been considered because of their unfamiliarity. 


254 ROBERT ROSEN 


The main point of this discussion is the following: one cannot 
as yet speculate on the nature of the states of 6, nor as to the 
mechanism by which the genetic observation takes place, with any 
degree of assurance on the basis of the current experimental evi- 
dence. In fact, it seems to be difficult even to conceive of a suit- 
able genetic experiment which can enable us to decide between 
various alternative mechanisms of the type described above. It is, 
of course, quite likely that light can be thrown on these questions 
by continued physico-chemical investigation of cellular macromol- 
ecules and their possible interactions. But even here, the various 
possibilities are so numerous that even such methods as these may 
be inadequate to resolve these problems. It may not be out of 
place to remark in this context that Niels Bohr has suggested that 
there may be classes of biological problems whose experimental 
solution is ¢mpossible in principle, just as it is impossible to de- 
sign an experiment which will simultaneously determine the posi- 
tion and momentum of an electron. 

For these reasons, we shall not attempt to hazard a guess here 
as to the actual nature of the states of the genetic system ©. Our 
primary purpose has in fact been to show that a number of the phe- 
nomena of genetics seem to be understandable from the viewpoint 
of the general theory of microphysical observation, independent of 
any special assumptions as to the mechanisms by which these ob- 
servations are carried out. It is not known whether such general 
considerations as those discussed above may fruitfully be applied 
to other phenomena of genetic interest; in particular, it will be im- 
portant to investigate how these considerations may be applied to 
problems involved in the replication of the states of the genetic 
system. We have attempted to indicate elsewhere (Rosen 1959b, 
1959d) some of the limitations which must be imposed in any dis- 
cussion of a ‘‘replicating’’ system, and to propose a concrete 
framework for discussion of this phenomenon (once again inde- 
pendent of any specific mechanism); it will be interesting to ob- 
serve whether these different types of arguments lead to a common 
point. 


This work has been aided by United States Public Health Service 
Grant RG-5181C2. 


LITERATURE 


Benzer, S. and E. Freese. 1958. ‘‘Induction of Specific Mutations with 
5-Bromouracil.’? Proc, Nat, Acad, Sci, 44, 112-119, 


QUANTUM APPROACH TO GENETIC PROBLEMS 255 


Chayen, J. 1959. ‘‘The Quantitative Cytochemistry of DNA and its Sig- 
nificance in Cell Physiology and Heredity.’’ Ezperimental Cell Re- 
search, Supplement #6, 115-131. 

Friedrichs, K. O. 1948. ‘‘On the Perturbation of Continuous Spectra.’’ 
Comm, on Applied Math. 1, 361-406. 

Goldschmidt, R. 1955. Theoretical Genetics, Berkeley: The University 
of California Press. 

Jacob, F. and E. L. Wollman. 1957. ‘‘Genetic Aspects of Lysogeny.’’ 
The Chemical Basis of Heredity (ed. W. D. McElroy and B. Glass), 
468-500. 

Kadison, R. V. 1958. ‘‘Theory of Operators, Part Il. Operator Alge- 
bras.’’ Bull, Am. Math. Soc. 64 (Number 3, Part II), 61-85. 

von Neumann, J. 1955. Mathematical Foundations of Quantum Mech- 
anics, Princeton: Princeton University Press. 

Rashevsky, N. 1959. ‘‘Suggestions for a Possible Approach to Molecu- 
lar Biology.’’? Bull. Math, Biophys. 21, 309-326. 

Riesz, F. and B. Sz.-Nagy. 1955. Functional Analysis. New York: 
Frederick Ungar Publishing Co. 

Rosen, R. 1959a. ‘‘The DNA-Protein Coding Problem.’’ Bull. Math. 
Biophys. 21, 71-95. 

1959b. ‘*‘A Relational Theory of Biological Systems H."* /dzd. 
21, 109-127. 

. 1959c. ‘‘Some Further Comments on the DNA Protein Coding 
Problem.’’ /bid. 21, 289-297. 
1959d. ‘*On a Logical Paradox Implicit in the Notion of a Self- 
Reproducing Automaton.’’ /bid, 21, 387-394. 

Schrodinger, E. 1944. What is Life? Cambridge: Cambridge University 
Press. 

Segal, I. E. 1947. ‘‘Postulates for General Quantum Mechanics.’’ An- 
nals of Math, 48, 930-948. 

———_-, 1955. A Mathematical Approach to Elementary Particles and 
their Fields, Mimeographed Lecture Notes, The University of Chicago. 


RECEIVED 1-14-60 


a Frege. % he ' wie} J 
? tae os 
‘sa 


BULLETIN OF 
MATHEMATICAL BIOPHYSICS 
VOLUME 22, 1960 


FURTHER CONTRIBUTIONS TO THE MATHEMATICAL 
BIOPHYSICS OF AUTOMOBILE DRIVING 


N. RASHEVSKY 


COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


The discussions of a previous paper (Bull, Math. Biophysics, 21, 
299-308, 1959) are generalized by considering that the angular direction 
error made by the driver, as well as the driver’s reaction time are not 
constant but are randomly distributed. Instead of a critical speed, at 
which the car will jump off the road, we now find that for every speed 
there is a probability of the car to jump off the road but that this proba- 
bility is vanishingly small for sufficiently low speeds, yet increases 
tapidly for high speeds. Thus a more realistic picture of the process of 
driving is obtained. When the standard deviation of the distribution func- 
tions for the angle and the reaction time are very small, the expression 
obtained here reduces to the expression obtained previously. 


In a previous paper (Rashevsky, 1959) we outlined an approach 
to a theory of automobile driving for the ideal case of a single car 
on an empty road. The purpose of this note is to generalize some 
of the restrictive assumptions made in that paper. 

The basic idea of the previous paper is that the driver makes on 
the average an angular error 6 in keeping his car from the exact 
direction of the road. As, due to that error, the car approaches 
sufficiently closely to the edge of the road, the driver corrects the 
direction making now the same average error in the opposite sense. 
Thus the car follows not a straight line but approximately a zig- 
zag line with a very small angle 0. 

As a first approximation we considered the angle 6 as constant 
thus obtaining a ‘‘regular’’ zig-zag line. Denoting by s the width 
of the highway lane, by 5—the smallest safe distance by which 
the car can approach the edge, by s,—the width of the car, by? ,— 
its length, and by T the driver’s reaction time to the noticed devia- 
tion from the curve, we found the following inequality as a condi- 


257 


258 N. RASHEVSKY 


tion of safe driving: 


Rep Serle tM (1) 
OT 
Actually the angle 6 is not constant but is distributed in some 
fashion around an average value 6. Let the distribution function 
be f(0) so that f(6)d@ denotes the probability that 6 lies between 
6 and 6+d6. We would at first expect 6 to be always positive. 
However, in principle it is conceivable that through an erroneous 
act the driver when for example approaching too closely the right 
edge, turns the wheel not to the left but to the right enhancing the 
error and actually jumping off the road. The probability of such 
errors must naturally be negligibly small, otherwise safe driving 
would be impossible. Such an error may be formally described 
as the appearance of a negative 6. If we take this point of view 
then 6 varies from —« to +o. We shall not, however, accept here 
this point of view and we shall consider that @ is distributed in the 
interval (0, +0). ~ 
We have 


+00 
/ #(0) d= 1. (2) 


0 


With a variable @ the zig-zag line is no more ‘‘regular.’? The 
angles of its different parts, as well as the length of each part 
varies now in a random fashion, governed by f(@). For each branch 
of the zig-zag line, characterized by a given value of 6, the argu- 
ment used in deriving (1) holds. Inequality (1) may be written thus: 


$-8,-28 
ih a ae 


G.< (3) 


Moreover, from what we said above, we must have @>0. Hence 


Oke Shae 
UT +0. 


(4) 


Since we now consider 6 as varying in the interval (0, +) there- 
fore, strictly speaking, we should not use the second inequality of 
(4), which is obtained by putting in expression (3) of loc. cit. 
sin 0=tan @=6. However, it is physically clear that if 6 be- 


AUTOMOBILE DRIVING 259 


comes so large that the above approximation does not hold, then 
the car will jump off the road. This means that f(@) must have a 
sufficiently small standard deviation, so that for values of 6 for 
which the approximation does not hold, f(@) becomes negligible. 
This justifies the argument which follows. However, if we wish to 
use the exact expression, the method used here is still applicable, 
except that we cannot obtain simple closed expressions. 

If for any part of the zig-zag line inequality (4) does not hold, 
then at the end of that part the car will jump off the pavement. 
The probability that this will happen is given by the probability 
that (4) is not satisfied. On the contrary, the probability P, that 
(4) zs satisfied is equal to the probability that no accident due to 
the car leaving the road will occur. But this probability is given 
by 


s—so—28 


PR. -| UA to. F(6) dé « (5) 
0 


The integral in (5) is a function of (s — s) -28)/(vT +1,) and 
of the parameters of the distribution function f(a). Thus for a given 
velocity wv there is a probability of safe driving. Since the upper 
limit of the integral in (5) is always positive and decreases with 
increasing speed v, therefore P, decreases as v increases. 

Instead of having, as before, a sharply defined critical speed, 
above which the car is bound to leave the road, we now find that 
with increasing speed v, the probability P, =1-P, of the car 
leaving the road increases. But there always remains a finite 
probability of ‘‘getting away’’ with impunity even at excessive 
speeds. This is a much more realistic picture. Notice that only 
for v = + « does P, become zero. 

If the standard deviation of f(@) is very small, then /(@) is very 
small for any value of @ which differs appreciably from the average 
value 9. Then, as long as 

Vigcb ck Rilialdas 6 a, (6) 
VT +1, 

P. will be practically equal to 1, in other words the conditions 
of safe driving will be satisfied. But (6) is identical with (1), if 
we identify 9 with 6. Thus our previous approach is a limiting 
case of the present more general one. 


260 N. RASHEVSKY 


As an example which enables us to make some numerical esti- 
mates let us assume that in the interval (0, 6) the function (6) is 
such that f(0) = 0; [df(6)/d6] ,.5 = 0, and 


i f(8)d0 = 4, (7) 


whereas 


ples 
for 9>6 f(6) = ares (8) 


Vro 
Such a choice of f(@) satisfies condition (2). 

If we denote by ®(z) the probability integral, then from equa- 
tions (5) to (8) we find: 


8-8, -—25 ee. 
a al aT+l 
ea — ® Senet: Sans See ° 
ae ae = (9) 
Hence 
a=3,. = 26 bss 
1 V~T+l1 
Pi =1-P,=— |1-0 \———_*—___], (10) 
2 oa 


If the probability of an accident is P,, then an accident will oc- 
cur on the average after every 1/P,, zig-zags. The average distance 
traveled during one zig-zag is, according to (4) of loc. cit. 

3-8,-25-6l 
z= sp eae pean F (11) 


Hence per unit distance we have 


6 
8$-8,-25-41, (iy 


zig-zags. If we wish to impose a ‘‘practically’’ safe driving for a 
total driving distance L, then we must require that 


Pinb <<1 (13) 
or 
1 1 
Rucears a hima atta : (14) 


AUTOMOBILE DRIVING 261 


The average distance L that a car will travel without jumping off 
the road is approximately 


z 
L a (15) 
Nothing can be said about the validity of expression (10) until 
we know the values of 6, T, and o. Let us see what we obtain by 
making plausible assumptions. Assume @ to be of the order of 
magnitude of 1°. In other words @= 0.02. Let o = 0.01, and T = 
0.2 sec. For /, we should take the wheelbase, for $,—the dis- 
tance between the two rear or the two front wheels. Take L,= 
300 cm, s, = 150 cm, 5=30 cm, and s = 300 cm. Then s — 85 - 
25=90 cm. Take vw = 30 m sec” !, which is about 70 mph. Then 
the argument of ® is 8. For such large values of the argument the 
order of magnitude of 1 - ®(z) is given by (Jahnke-Emde, 1928): 


e7* 


1- (2) = (16) 


Hence with the above values we find from (10) that P, ~ 107°. 


For this case we have n ~ From (15) we have L ~ 4 x 


1 
4200” 
10°? ecm = 4x10?" km. The driving on an empty road would be 
quite safe under these conditions. 

If the speed vw is doubled, becoming about 140 mph, then the ar- 
gument of ® becomes equal to 4. Hence P, ~ 10°°, and L ~ 101? 
cm=10° km. The driving is still safe. If, however, for this 
speed, 60 m sec”’, we would take a narrower pavement, say s = 
270 cm, then s — s, — 26 = 60 cm, and we find P,, = 0.0023. With 
the above value of s we have n = 1/2700, and therefore L ~ 1.2 x 
10° cm =12 km. Thus a car with the assumed dimensions travel- 
ing at 216 km hour! on a pavement 300 cm wide could do it quite 
safely, while on a pavement only 30 cm narrower, it would jump off 
the pavement about every 12 km. At half that speed, that is at 
30 m sec” !, even on the narrower pavement we find L ~ 10&km and 
the driving quite safe. 

We may also consider the case in which 7 is distributed around 


an average value T. Let the distribution function of T be u(T). 
We now write expression (1) as follows: 


(OTL OS 8 = 852 Oe (17) 


262 N. RASHEVSKY 


The probability that 6 is between 6 and 9 +d4, while T is be- 

tween T and T+ dT is 

F (0, tT) ={(@) u(t) deat. (18) 
The probability P, that (17) is satisfied is given by the integral 
of F (0, T) over the area bound by the coordinate axes 6 = 0, T = 0, 
and the curve 

(vt +l )@=8s-8,-25. (19) 
Hence 


SSE 
c= vttl 
pie 0 ro dt. (20) 
i ue) | 


| 


Again we obtain P. as a function of w and of the other parameters. 


This work was aided by a grant from Dr. Wallace C. and Clara 
A. Abbott Memorial Fund of the University of Chicago. 


LITERATURE 


Jahnke, E. and Emde, F. 1928. Funktionentafeln mit Formeln und Kurven. 
Leipzig: Teubner. 

Rashevsky, N. 1959. ‘*Some Remarks on the Mathematical Aspects of 
Automobile Driving.’’ Bull. Math. Biophysics, 21, 299-308. 


RECEIVED 3-3-60 


BULLETIN OF 
MATHEMATICAL BIOPHYSICS 
VOLUME 22, 1960 


A CONTRIBUTION TO THE MATHEMATICAL BIOLOGY OF 
AUTOMOBILE DRIVING: II. PASSING AS A CASE OF 
PSYCHOPHYSICAL DISCRIMINATION 


N. RASHEVSKY 
COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


The decision to pass or not to pass in view of an oncoming car is con- 
sidered as a case of comparative judgment in which it is to be decided 
whether the time it will take to pass safely is greater or less than the 
time it will take to collide with the oncoming car. H. D. Landahl’s well- 
known theory of psychophysical discrimination is used, and it is as- 
sumed that the ‘‘distracting stimuli’? considered previously (Rashevsky, 
1959, Bull. Math, Biophysics, 21, 375-85) tend to increase the standard 
deviation of Landahl’s fluctuation function. Effects of the ‘‘distracting 
stimuli’’ on the threshold of the neuroelements in Landahl’s circuit are 
also considered. On this basis an expression is derived which gives the 
probability of a collision accident in passing as a function of the ‘‘dis- 
tracting stimuli.”’ 


This paper is a continuation of a series of previous papers (Ra- 
shevsky, 1959a,b; 1960b; hereinafter referred to as I, IJ, and III). 
We shall discuss here a possible neurobiophysical model of the be- 
havior of the driver while passing another car when at the same 
time an oncoming car is in view. The process of behavior is ac- 
tually very complex. We shall however again use a highly over- 
simplified abstraction. We may consider the reaction of the driver 
as one of psychophysical judgment. In effect, the driver must 
make the decision whether he will have enough time to pass safely 
or whether an attempt to pass will lead to a collision with the on- 
coming car. This is not the clear-cut and simple case of deciding 
which of two given quantities is the greater (or smaller). Yet the 
analogy between the two processes is quite pronounced. It is 
therefore natural to attempt to apply H. D. Landahl’s (1938, 1939, 
1940; for a brief exposition see Rashevsky, 1959b, 1960a) well- 


263 


264 N. RASHEVSKY 


known neurobiophysical theory of psychophysical discrimination to 
this case, 

According to this picture, when the time needed for safe passing 
is almost equal to the time which it will take for the given car to 
collide with the oncoming car, the probability of a wrong judgment 
and therefore of an accident will be nearly +s if the threshold in 
Landahl’s circuit is neglected and if therefore only two categories 
of judgment, namely ‘‘correct’’ or ‘‘wrong,’’ are possible. Under 
these conditions the number of accidents would be tremendous. 
Therefore it is clearly necessary to consider Landahl’s general 
case, in which the threshold is finite and a ‘‘no judgment’’ situa- 
tion is possible. When the driver is unable to make a decision, he 
does not attempt to pass. The effect is the same as a negative 
decision. However, the introduction of the ‘‘no judgment’’ situa- 
tion reduces the probability of the ‘‘wrong’’ response. 

Denoting by h* the threshold of the system, by p(A) Landahl’s 
distribution function of threshold fluctuations, and by ¢ the differ- 
ence between the actual time necessary for safe passing and the 
actual time necessary for collision (¢6 > 0 means ‘‘unsafe’’), we 
have for the probability P,, of a ‘‘wrong’’ response (that is, of estz- 
mating the conditions for passing as safe while they actually are 
not safe): 


—(h*+¢) 
P,~ [ p(A)dA. (1) 


—oo 


Let us consider the effects of ‘‘distracting stimuli’’ (II) such as 
road shocks, road signs, noise, and, not the least, other cars on the 
highway. As we have seen in I, different distracting stimuli de- 
pend in different manner on the speed v. First of all, according to 
the general picture used in the mathematical biology of the central 
nervous system and discussed in II, any distracting stimulus will 
increase the threshold h*. This will result, as seen from (1), in a 
decrease of P|. Thus we seem to arrive at the absurd result that 
the probability of making an error in passing decreases with in- 
creasing distraction. The physical meaning of this effect is that 
the driver, because of increased A*, increases his ‘‘no judgment”? 
range and will thus exercise more care in avoiding passing in 
dubious cases. That such an increase in carefulness due to in- 
crease in distractions may exist is quite plausible. 

This effect will however be upset by the effect of distracting 


AUTOMOBILE DRIVING: II 265 


stimuli on the function p(A). From the nature of this function it 
is to be expected that any exogenus distracting stimuli will tend to 
increase the standard deviation of p(A) (Rashevsky, 1959b, 1960a). 
But an increase of the standard deviation of p(A) will increase Pe 
In the interpretation of some experimental data H. D. Landahl 
used for p(A) either a normal distribution function or the function 


k 
p(ay= =e# lal, (2) 


Both lead to very satisfactory results. Because of the much easier 
way of obtaining closed expressions with the function (2), we shall 
use the latter. 

In this case we have (Rashevsky, 1959b) for ¢ > 0: 


P a ad (3) 


i 
w 2 
The general expression (1) gives us the probability of an error 
for a given ¢. Thus we have P, =P, (¢). Different values of ¢ 
may occur in general with different frequency on the highway. 
Thus for a straight stretch of road with good visibility, all values 
of ¢ may be equally probable. On a road with many turns higher 
values of g may become more likely. If f(¢) is the distribution 
function of ¢, such that 


[ he dighde =; (4) 


aed 


then the total probability of a wrong judgment is 


+00 
ra COLE (5) 

We must keep in mind that when ¢ <0, that is, when it is safe 
to pass, a wrong response does not lead to an accident. Hence in 
estimating the probability of an accident, we should use not the 
quantity P,, but a different quantity, namely one obtained by eX: 
tending the integration over positive values of ¢ only. This quantity 


+co 
p*= / P., (d) f(¢) dd (6) 
0 


Ww 


gives the probability of collisive accidents in passing. 


266 N. RASHEVSKY 


To illustrate the above, let us consider the case in which f(¢) = 1. 
Then, from (6) and (3) we obtain 
1 
ee ey Sal? (7) 
Pe Qk 
If D denotes a distracting stimulus, h* and &, the values of h* 
and & in the absence of D, and if a, and a, are factors of propor- 
tionality, then, assuming as a first approximation a linear variation 
of k with D we have, according to what was said above: 


hA*=h* +a,D; k=k,-a,D. (8) 
Introducing this into (7) we find: 


pt = i g7(ho- 42D (hS +.a,D) (9) 
w = 2(k, - a,D) 

If the linear approximation (8) for & would hold for all values of 
D, then P,* would become infinite for D=k,/a,. This cannot 
happen since P, <1. Within a small range of variation of D we 
see, however, that P'* may first decrease with D if a, is suf- 
ficiently large but will again increase due to a decrease of k. The 
first phase represents the above-mentioned increase of caution. 
The second phase is an effect in the opposite direction, which in- 
creases P* in spite of increased caution. 

For D we may substitute any of the distracting stimuli discussed 
in II or their combinations. Similar effects will be obtained by 
considering D as due to fatigue effects (Il). 

Any experimental verification of expressions similar to (9) under 
actual road conditions is of course precluded. It may be possible, 
however, to design a simulator for conditions of passing; then an 
experimental study of (9) or other expressions derived from similar 
considerations is likely to lead us eventually to a better under- 
standing of the neurobiophysics of passing. 


This work was aided by a grant from the Dr. Wallace C. and 
Clara A. Abbott Memorial Fund of The University of Chicago. 


LITERATURE 


Landahl, H. D. 1938. ‘*A Contribution to the Mathematical Biophysics 
of Psychophysical Discrimination.’? Psychometrica, 3, 107-125. 

+ 1939. ‘*A Contribution to the Mathematical Biophysics of 
Psychophysical Discrimination Il.’’ Bull, Math. Biophysics, 1, 159-76. 


ed 


Die eas ot Sh) 


Pee oN eos eee ee 


AUTOMOBILE DRIVING: II 267 


. 1940. ‘*A Contribution to the Mathematical Biophysics of 
Psychophysical Discrimination II].’’ /bid,, 2, 73-87. 
Rashevsky, N. 1959a. ‘‘Some Remarks on the Mathematical Aspects of 
Automobile Driving.’’ Bull. Math. Biophysics, 21, 299-308. 
——. 1959b. Mathematical Biology of Social Behavior. Revised Ed. 
Chicago: University of Chicago Press. 
——. 1959c. ‘‘Mathematical Biophysics of Automobile Driving.’’ 
Bull. Math. Biophysics, 21, 375-85. 
——. 1960a. Mathematical Biophysics, Il. 3rd Ed. New York: Dover 
Publications, Inc. 
. 1960b. ‘‘Further Contributions to the Mathematical Biophysics 
of Automobile Driving.’’ Bull. Math. Biophysics, 22, 257-262. 


RECEIVED 1-26-60 


“a 7s 

| ++ Sarah, Setatid 

E r oe oy 
‘eit (a Bee 


~ A 
e Ast Pa. 


leek te ee 


ULLETIN OF 
MATHEMATICAL, etc 
VOLUME 22, 1960 


THE FLUX-FORCE RELATIONS ACROSS COMPLEX 
MEMBRANES WITH ACTIVE TRANSPORT 


BRADLEY T. SCHEER 
DEPARTMENT OF BIOLOGY 
UNIVERSITY OF OREGON, EUGENE, OREGON 


Equations are derived for the total material flux, and the total electric 
current flux, across a complex membrane system with active transport. 
The equations describe the fluxes as linear functions of forces across 
the system, and specifically of electrical potential, hydrostatic pressure, 
chemical potentials, and active transport rates. The equations can be 
simplified for experimental studies by making one or more of the forces 
equal to zero. The osmotic pressure difference across a membrane sys- 
tem is shown to be a function of the electrical potential and chemical 
potential differences and of the active transport rates. The transmem- 
brane potential is shown to be the sum of a diffusion potential and an 
active transport potential. A simple equation is derived describing the 
current across a membrane as a linear function of the electrical potential 
and the active transport rate. Specific examples of the application of the 
equations to nerve membrane potentials are considered. 


In a previous paper I developed a definition of active transport 
in terms of a complex membrane model and the concepts of the 
thermodynamics of irreversible processes (Scheer, 1958). In the 
present paper, a similar model system is used to derive equations 
relating material flux and electric current flux to differences in 
pressure, electrical potential and chemical potential across the 
system, using the approach which Staverman (1952) has developed 
for a simple membrane without active transport. The basic equa- 
tions then lead to a variety of useful relations suitable for experi- 
mental study of membrane processes generally and of active trans- 
port processes in particular. 

We shall consider a complex membrane system of finite thick- 
ness, with parallel faces bathed in external media 0 and z, each of 
which includes constituents 1,2,...2,j,...n. We shall examine 
the flux-force relations across an element of this membrane system 


269 


270 BRADLEY T. SCHEER 


extending from 0 to 2 at right angles to the faces, and of unit 

cross-sectional area. We shall assume that, within the element 

under consideration, there are successive elements of volume 

Pas Barre Day Pa 4d. cee ars one cea each homogeneous, and _ sepa- 

rated from the external phases p, and p, by phase boundaries, 

which we shall consider to be thin membranes m,,m,,...m 
.M »5+++m,, as in the diagram: 


a? 


Phases p: 


Membranes m: 


This system may, with appropriate modifications, be taken as 
equivalent to many biological membrane systems of known struc- 
ture; the simplest case is evidently a single thin membrane m, 
separating phases p, and p,, as in the cell membrane of most 
cells. A more complex situation is that involving phases p,,p,, 
Pp, and thin membranes m,,m, as in the cytoplasmic membrane 
system separating vacuole from external medium in plant cells. 
Systems such as frog skin are probably still more complex. 

Let the flux of a constituent z of the external phases p, and p, 
be the net number of moles dn,, of ¢ entering p, in time dt, or the 
net number of moles —dn;, leaving p, in time dt, and let it be 
designated by 


tipped aE 
SE ST” (1) 
Let the flux of ¢ in the game direction from p, towards Px across 
any thin membrane m be hPa 
The constituents of the. external phases p, and p, may enter 
into chemical reactions within the system. We shall consider only 
those reactions which contribute to the active transport of the con- 
stituent as defined earlier (Scheer, 1958). Two cases of such 
transport reactions will be considered, namely a mobile carrier 
mechanism and a fixed carrier mechanism. 
In the mobile carrier mechanism, the transported constituent 7 


enters into a reaction « in some phase p, and is transformed into k. 
We may write the reaction thus: 


Citta... —> paises (2) 


FLUX-FORCE RELATIONS 271 


Here v represents a stoichiometric coefficient, not necessarily in- 
tegral, and the reaction may involve reactants and products other 
than z and &, and may occur in one or more steps. We shall be con- 
cerned only with the reactant z and its final product & and their 
stoichiometric ratio v. For each such reaction c, there must be in 


some other phase p,’ reciprocal reaction «’ in which ¢ is regener- 
ated from k, thus: 


a 


FR Me Se ee Se ee (8) 


We shall assume that the product & remains entirely within the 
system, and does not appear in the external phases p, or p,. To 
provide for active transport, reactions 2 or 3 or both must be 
coupled with other reactions, or alternatively the reactants and 
products of 2 must be different from those of 3, to provide for a 
net increase in entropy in each reaction, and to compensate for 
the entropy decrease connected with the active transport, as de- 
scribed earlier (Scheer, 1958). 

Let the velocity of reaction « be the net number of moles vdn,, 
of product & formed, or the net number of moles —dn,, of reactant ¢ 
used in reaction « in time dt, and be designated by 


dt dt (4) 


| Ge aa OE ae ae 28 (5) 


We now assume that the system operates in a stationary state. 
The condition is that, for any phase other than the external phases 
Po and Px» 

dn 


—ls = cca 6 
W709 (=1,2,--- 0) (6) 


From this condition it follows, for those phases p,... p, and 


Pa’ 41+++ P, Which are not concerned in the processes involved in 
active transport, that 
> > —, > = > 
= = = J 7 
en =Jin, =: =4im, * ima’s4 ima’s9 im x (7) 


272 BRADLEY T. SCHEER 


It also follows that 


8 
Via Vy’ ( ) 
and 
> 
= 9 
Vi= kmos14 . VI imo’ ( ) 
and 
> > > ; 
AF con Eg ee eee kmay “vs 
> > > (10) 
SY et ES em img’ 44 


and, for the whole system, combining 7 and 10, 


> > > ; ; 
Af: ° natal ee = s al = 
i=Jin, = img imay4 Vv km as (11) 
4 > 7 -> 
= Jima’ t Md poke = im a’s1 =.e- = ot ima 


In accordance with the principles of the thermodynamics of irre- 
versible processes (See Prigogine, 1955), we can express any flux 


=> 
J, in the relation 


> 
Ji=>) L;,X; (12) 
j= 
as 
where X, represents the thermodynamic forces acting upon the con- 
> > 
stituents j. In the present considerations, both J; and X; are vec- 
torial. The coefficients L,, are considered to be linear in the 


neighborhood of equilibrium, and to obey the Onsager reciprocity 
rule 


Lia Lj, (13) 


The linearity and reciprocity of the coefficients have been verified 
experimentally in a number of cases sufficient to permit reason- 
able confidence in this assumption (Miller, 1960). 

In accordance with 12, we may write for m,, for example, 


> j=n = 


wi = oe Ens ial dik (14) 

j=l 

igh 
and similarly for each thin membrane not in that part of the system 
concerned in the active transport processes. For m,,, and succeed- 


FLUX-FORCE RELATIONS 273 


ing membranes to m,Wwe have 


E juk,n 

et = Piet it ime 
j=l (15) 
jan 4 A 

: Ds eames, mes i SP Sre Cae 
jak 
and 

+ = > > 

Yaa ¥ DF D mces amis) ‘: Cikmayt be ne ’ a 
jak 


By substitution of the sums in the form of 14-16 in 11, we may 
then obtain by addition 


he M=M x j=un ae m=Mq?’ > 
ec), = pe >, L iim dim * 2 bi att 
m=my, j=1 m=Mqi il 
jek 
m=mM q’ 
v Linm Xo (17) 
m=amaq+t 


m=mq41 j=l 
if 


where 2 is the total number of thin membranes between p, and p,. 
We may now define a set of mean coefficients L;j for the whole 


system by 


ie : (18) 


274 BRADLEY T. SCHEER 


Similarly we define, for the region from p, to p_.,, 


Eee (19) 


and 


To. ite ble Sere (20) 


and 


Li, = —__.. (21) 


Ma+1 


In general, the thermodynamic force differential across the entire 
system will be 


x, . F ae (22) 


X ae eee (23) 
Maal 
We can therefore write 17 as 
> ale > jun > > > 
Jem ey Xt V Li, X54 Li, X41 +04,X4. (24) 
j=1 j=l 


idk ink 


FLUX-FORCE RELATIONS 275 


This may be written more simply as 


Jy LX, + 0 (25) 


where Je is the active flux of z, given by 

> oe = > > 

pa Lh Xt ea tt (26) 

jak 

The first sum here represents the effects of the forces X’ on the 
flux of &; the second term represents the effects of the forces x 
on the flux of 7; the third term represents the effects of the feeek 
x on the flux of & Since all of these relations arise out of the 
Fe ticns in which & is produced and consumed, and have an effect 
in the total flux of z, they may be properly considered as contribut- 
ing to the active flux of z. 

In the fixed carrier mechanism, an example of which has been 
discussed by Koblick (1959), the product of reaction 2 is bound to 
the structure of the system and does not undergo translational 
motion. Consequently, reaction 3 must occur in the same locus as 
2. It may be assumed that, for such a mechanism, both reactions 
occur in a single thin membrane such as m,. The stationary state 
condition then gives us 

a > 


ed ee SOS aL eee (277) 
t t My 


m4 img t ima+1 
and a general equation similar to 25 can be derived as before for 
this mechanism. The exact form of ia will depend on the mecha- 


nism assumed. x 
The total material flux in moles per sec across the system, J, 
is now given by 


jan jen i=n jan 

> 

JaX* 0,d;=)) % L,jX,+)) v4; x 
j=l j=l iml j=l 


Pie ee gain sift 

imal oe eo 98 
tek vo 
jd 


where v, is the partial molal volume of j. 


276 BRADLEY T. SCHEER 


Similarly, the flux of electric current Tis given by 


jen j=n i=n jun 
> > 
| = 2j,d;= >) 2; )) LijX,+ | 2,d; x 


Woe ep orers 
fat ooe. ot 
ixk 
j#k 


(29) 


Here z, is the electrochemical valence of j, including the faraday F. 
We may now, following Staverman (1952), define the following: 
Mechanical permeability L ,: 


b= om ve L ,%,2, (30) 
Electrical permeability L,: 
Lp = pes ye L 58,2; (31) 
: j 
Electromechanical permeability L, ): 


i =>) oy L 5242; (32) 
j 


i 


Let the only thermodynamic forces operative be the hydrostatic 
pressure P, the electric potential EF, and the chemical potentials 
A u;. 
Then 


X,=0,P+2,E+Aug, (33) 
where 
PeP -P. E=-%,-9, A Hy = Myo — Hye (34) 


We then define mechanical transference numbers r, and ry by 


Vj; d; ha 
foe, ee pe (35) 


FLUX-FORCE RELATIONS 


and 


We then have from 30 and 35 
J «= a v, Da L jv ,P = boE 
j i 


Hence, in general, 


Similarly, we define electrical transference numbers t, and ¢% 


LE f P=0 
pig hit J4 = 0 
J ] 5 | 
Ap, =0 
and 
tj 
t? = —- 
ae ae 
J 
and in general, 
2; a L 42; 
i 
oe 9 


We may then write the general flux equations 28 and 29 as 


> 74 
J= 7 OF De L,(2,E+,P+Ap,) + 3 v J+ 
7 i j 


> 

, ‘A 

~LypE+LpP+Lp ) ri Au; + } v J; 
j j 


277 


(36) 


(37) 


(38) 


(3 9) 


(40) 


(41) 


(42) 


278 BRADLEY T. SCHEER 


and 


™ 


= DS 2, De L (2,8 + v,P+Ap,) + aE 2 aA 


j i j 


, 7A 
=L,E+L,,P+L, e Au, + d° 2 Ji 


j J 


(43) 


Applications to Biological Systems 


Osmotic relations: Equation (42) may be used in the study of os- 
motic relations across membrane systems with active transport. 
Thus, if we use the customary definition for the osmotic pressure 
difference II across a membrane as the value of P for which J = 0, 
then (42) becomes 


aged i re LB Frey (44) 
; 


J 


The osmotic pressure so defined is evidently not a simple function 
of the concentration differences across the membrane. If we re- 
call that, in several cases where membrane systems such as those 
of frog skin, fish or crab gills separate body fluids from the ex- 
ternal medium, active transport processes are known to occur and 
bioelectric potential differences have been observed, it is clear 
that the osmotic relations across such membrane systems are not 
simply determined by the differences in freezing points, for ex- 
ample, of the internal and external fluids. 

The determination of the coefficients L,, and L, in equation 
(42) for living membrane systems poses difficult technical prob- 
lems. The simplest approach, applicable to some systems at least, 
would appear to be in a study of the flux-potential-pressure rela- 
tion when Ax, = 0, a condition which can be realized by making 
the solutions on the two sides of the system (0 and z) identical in 
composition. Equation (42) then becomes 


J=LppE+LpP +) v,J4 (45) 
i 


If the actively transported substances have been identified (Scheer, 
1958), it is possible to determine J4 from a modification of (43), 
as outlined below. We may also, by using an external emf in the 


FLUX-FORCE RELATIONS 279 


Ussing short-circuit technique (Ussing and Zerahn, 1951), make 
E = 0, when (45) becomes 


J=Lp,P+ oy v,J4 (48) 

j 
The value of L, can then be determined by measuring mechanical 
flux as a function of pressure. A value of L,, can then readily 


be determined by varying the electrical potential with pressure 
difference at zero, when 


ppet De vJ4 (47) 


j 


It is obvious that JA can be determined by holding both E and P at 
zero, when 


J= >" 0,J4 (48) 


j 
j 


It is probable, however, that the mechanical flux under these con- 
ditions will be small. 

Current flux: Equation (48) is much more readily applicable to bio- 
logical systems. In many cases of interest, we will find the hydro- 
static pressure difference across the system to be zero, or we can 
make it so experimentally. We then will have 


> > PA 
[=L, E+L, DG Awyt Do 27; (49) 


J J 


If J4 is invariant with E, then this equation describes a linear 
relation of J to E, the slope of which is L,, the electrical permea- 
bility or conductance of the membrane. For limited ranges of 
values of / and E, such a linear relation has been observed for the 
skin and large intestine of a frog (Pumphrey, 1934; Cooperstein 
and Hogben, 1959), the bladder of a toad (Leaf, Anderson, and 
Page, 1957), the squid giant axon membrane (Hodgkin, Huxley and 
Katz, 1952), the cell body membrane of brain cells in the puffer 
fish (Hagiwara and Saito, 1959) and of spinal motor neurons in an 
unspecified mammal (Frank, Fuortes and Neison, 1959), and the 
striated muscle fibers of the cat (Boyd and Martin, 1959), for ex- 
ample. In the case that Ay; = 0, and J4 = 0, the relation simply 
reduces to Ohm’s law, but when these conditions are not met, the 


280 BRADLEY T. SCHEER 


line relating 7] and E does not pass through the origin. The devia- 
tions from linearity which were noted in most of the membranes 
when the membrane potential £ was shifted far from its normal 
‘tresting’’ value are of considerable importance. They may be 
explained either as a change in the value of L, or ¢; or in the value 
of Vee Hodgkin and Huxley (1952) assumed a ole in ‘‘sodium 
earnienbillty’s which might be interpreted as a change in L, or 
ty, but is probably, rather, a change in JA Hagiwara and Saito 
(1959) present some evidence suggesting that the active transport 
process of the nerve cell membrane is in fact changed during 


excitation. 


Resting potentials: Equation (49) may be further simplified in cer- 
tain experimental situations. One of the most important measure- 
ments in the study of bioelectricity is that of the resting potential 
E, , defined as the value of E when / = 0, and given by 


R 
E , 1 TA 
ira ed eee 2 , J (50) 
j ee 

It is evident that (50) is equivalent to the classical electrochemical 
caus tions for diffusion potentials (Hittorf equation, Nernst equation) 
when J4 = 0; it is equally evident that the active transport process 
makes a distinet contribution to the resting potential, and one 
which is entirely independent of the diffusion potential. 

The determination of transport numbers ¢j may be made, with the 
aid of (50), by studying the effect of changes in concentration of 
the medium on one side of the membrane (e.g. in phase 0). Con- 
sider the membrane potentials for two different values of the 
chemical potentials of an anion a and a cation c’: 


I , , I , 
By mess) te AB, — (eo — Max) — to (et — B.,) - 
h 
1 ai 
z9 ) 2 ,J5 (fim 1,9, . SR ay0, sue) (51) 
Es. 
J] 


I , +1 , 
ER ae ti An, Sete “a ad - (ul), 77 Hox) 7 


h 
1 JA 
Toe’ 
i 


FLUX-FORCE RELATIONS 281 


Subtracting, we obtain 


Be Eg = t,(ull, ~ wl.) +e, (ull, - ul) (53) 
or, since approximately 
Bt 
wll pte RT In — (54) 
oF 
ei _ el opr le TI, , a 
r-~=R® inc, (/e.o + % In ee (55) 


co 


This type of measurement has often been made with nerve and 
muscle cells (see for example Hodgkin, 1951); it is found in general 
that a tenfold increase in the external KCl concentration is ac- 
companied by a decrease in resting potential of about 60 nV at 
15-20°C. Since the concentration of potassium ion in the usual 
media for such cells is much less than that of chloride ion, the 
principle anion, we may neglect the change in chloride ion con- 
centration, and write 


I 
ER -ER 0.060 
= apres a | Pe tr A a a a 
RT Inc,/cy 8.3 x 300 x 2.3 


, 


tt = 1.035 x 10°> (56) 


or, from (34), 


ty = ty 2, = 1.035 x 10° *F = 1.00 (57) 


From this we must conclude that most or all of any passive current 
flowing across the resting nerve membrane is carried by potassium 
ion. We may then rewrite (50) approximately for the nerve axon 
membrane, where the sodium ion is known to be actively transported, 


7A 
2na4Na ; (58) 
Ly 


E =-t,Apy - 


Or, if we follow Hodgkin and Huxley (1952), and use the potassium 
equilibrium potential E+ defined as 


2 Gre (K*), Aux AK 


rei oer 2x 


E 


282 BRADLEY T. SCHEER 
we have for (50), 


2nad) 

Ep = Ext - rT (60) 
This means that the resting membrane potential of an axon may be 
thought of as originating from two opposing emf’s; a potassium 
equilibrium potential, and an active transport potential for sodium 
ion. Hodgkin and Huxley (1952) have clearly demonstrated that 
the excitation of the axonal membrane is associated with a sudden 
influx of sodium ion; it seems reasonable to suggest that this in- 
flux could result from a sudden inactivation of the sodium transport 
mechanism, rather than from the increase in ‘‘sodium permeability’’ 
which they postulate. 


The current-voltage relation: With certain systems such as frog 
skin (Ussing and Zerahn, 1951), the frog large intestine (Cooper- 
stein and Hogben, 1959), the toad bladder (Leaf, Anderson and 
Page, 1957), and certain large plant cells (Blinks, 1935), it is 
possible to make the chemical potentials on the two sides of the 
system identical, and the hydrostatic pressure difference zero, 
when (43) becomes 


I=Ly E+) a,JA (61) 
j 

This equation permits the determination of L,; and of S2,J4 from 
asimple //E plot, if the relation is linear. Pumphrey (1934), using 
frog skin, found a linear relation for the region in which the skin 
potential is negative in our convention (positive in his); we have 
confirmed his results. Ussing and Zerahn (1951) have already 
shown, in their classical ‘‘short-circuit’’ experiment, that the 
current across frog skin at zero potential, given by 


I=)" 2,74 (62) 
j 


is equivalent to the flux of sodium ion; Leaf, Anderson and Page 
(1957) have recently shown the same thing for the toad bladder, as 
have Cooperstein and Hogben (1959) for the frog large intestine. 
Equation (861) therefore provides us with a valuable tool for the 
study of the properties of membranes in which active transport is 


FLUX-FORCE RELATIONS 283 


occurring, since it permits us to distinguish unequivocally be- 
tween the effects of any experimental treatment on permeability 
(L,) and on the active flux Cpa: 


Summary 


The model used in a previous paper (Scheer, 1958) to develop a 
definition and experimental criteria for active transport is used to 
explore the relations of fluxes and forces across complex mem- 
branes with active transport. The results are considered to be of 
sufficient generality to include most biological systems in which 
active transport occurs. The two basic equations (42, 48) give, 
respectively, mechanical flux and electrical current flux as simple 
linear functions of differences in electrical potential, hydrostatic 
pressure and chemical potential across the membrane, and the 
active flux. The equations can be further simplified by experi- 
mental situations in which one or more of the variables is held at 
zero to permit determination of the parameters of the equations. 
The equations are identical in the absence of active transport 
with those derived by Staverman (1952) for a simple membrane, and 
these in turn are of the same form as, but more general than, 
classical electrochemical equations. The available experimental 
evidence in the form of current-potential curves and the relation of 
potential changes to concentration changes for actual membranes 
is described by the relevant equations. 

This study was supported by a grant (G8619) from the National 
Science Foundation. The author is again indebted to Professor 
Pierre Van Rysselberghe for his interest and for valuable sug- 
gestions in the course of the work; Dr. H. D. Landahl has also 


made helpful suggestions. 
LITERATURE 


Blinks, L. R. 1935. ‘*Protoplasmic potentials in Halicystis. IV. Vacuo- 
lar Perfusion with Artificial Sap and Sea Water.’’? J. Gen. Physiol. 18, 
409-420. 

Boyd, I. A. and Martin, A. R. 1959. ‘‘Membrane Constants of Mammalian 
Muscle Fibers.” J. Physiol. 147, 450-457. 

Cooperstein, I. L. and Hogben, C. A. M. 1959. ‘‘Ionic Transfer across 
the Isolated Frog Large Intestine.’’ J. Gen. Physiol. 42, 461-473. 

Frank, K.; Fuortes, M. G. F. and Nelson, P. G. 1959. ‘‘Voltage Clamp 
of Motoneuron Soma.’’ Science, 130, 38-39. 


284 BRADLEY T. SCHEER 


Hagiwara, S. and Saito, N. 1959. ‘*Membrane Potential Change and 
Membrane Current in the Supramedullary Nerve Cell of the Puffer.’’ J. 
Neurophysiol., 22, 204-221. 

Hodgkin, A. L. 1951. ‘‘The Ionic Basis of Electrical Activity in Nerve 
and Muscle.’’ Biol. Rev.,.26, 339-410. 

Hodgkin, A. L. and Huxley, A. F. 1952. ‘‘A Quantitative Description of 
Membrane Current and its Application to Conduction and Excitation in 
Nerve.’’ J. Physiol., 117, 500-544. 

Hodgkin, A. L.; Huxley, A. F., and Katz, B. 1952. ‘*‘Measurement of 
Current-voltage Relations in the Membrane of the Giant Axon of Loligo.’’ 
J. Physiol., 116, 424-448. 

Koblick, D. C. 1959. ‘‘An Enzymatic Ion-Exchange Model for Active 
Sodium Transport. J. Gen. Physiol., 42, 635-645. 

Leaf, A.; Anderson, J., and Page, L. B. 1957. ‘*Active Sodium Transport 
by the Isolated Toad Bladder.’’ J. Gen. Physiol., 41, 657-668. 

Miller, D. G. 1960. ‘‘Thermodynamics of Irreversible Processes. The 
Experimental Verification of the Onsager Reciprocal Relations.’’ 
Chem. Rev., 60, 15-37. 

Prigogine, I. 1955. J/ntroduction to the Thermodynamics of Irreversible 
Processes. Springfield. 

Pumphrey, R. J. 1934. ‘*The Electrical Properties of Isolated Frog 
Skin.’? J. Eep. Biol., 11, 423-432. 

Scheer, B. T. 1958. ‘*Active Transport: Definitions and Criteria.” 
Bull. Math. Biophys., 20, 231-244. 

Staverman, A. 1952. ‘‘Non-equilibrium Thermodynamics of Membrane 
Processes.’’ Trans. Faraday Soc., 48, 176-185. 

Ussing, H. H. and Zerahn, K. 1951. ‘*‘Active Transport of Sodium as the 
Source of Electric Current in the Short-Circuited Isolated Frog Skin.’’ 
Acta Physiol. Scand., 23, 110-127. 

RECEIVED 2-8-60 


MOLECULAR SET THEORY: A MATHEMATICAL REPRESENTA- 
TION FOR CHEMICAL REACTION MECHANISMS 


ANTHONY F. BARTHOLOMAY* 
HARVARD UNIVERSITY 
BOSTON, MASSACHUSETTS 


The present investigation is part of a program aimed at examining the 
mathematical basis of reaction rate theory (Bartholomay, Bull, Math, 
Biophys. 20, 1958) from the point of view of individual molecular events. 
Certain inconsistencies resulting from the classical procedure of using 
stoichiometric ‘‘chemical equations’’ to represent the kinds of mecha- 
nisms involved are pointed out and remedies are suggested by the intro- 
duction of set-theoretic notation within the framework of a so-called 
Molecular Set Theory. It is concluded that whereas ordinary algebra is a 
suitable basis for stoichiometry, in discussing mechanisms of chemical 
reactions, molecular set theory is more appropriate. This is a theory of 
material transformations which is patterned after abstract set theory: the 
individual molecules of a chemical species and chemical transformations 
from one species to another correspond to the abstract concepts of ele- 
ments or points of a set and their mappings into other sets respectively. 
The new notions of exact molecular sequences and equivalence classes 
of molecules arising from this study of chemical reactions may be of 
purely mathematical interest when referred back to the context of ab- 
stract set theory. 


Stoichiometric chemical equations are the symbolic expressions 
of the identity of total weight and atomic composition which have 
been shown to exist between reactant chemical species and their 
chemical transforms or products of reaction. The present investi- 
gation was undertaken primarily to examine the mathematical con- 
tent of such ‘‘equations’’ as A + B —» C, which suggest a mixture 
of algebraic symbolism (the plus sign) and transformation theory 
(the arrow). From this has resulted the designation of a new 


*The Biophysics Research Laboratory of the Department of Medicine, 
Harvard Medical School and The Peter Bent Brigham Hospital. 


285 


286 ANTHONY F. BARTHOLOMAY 


mathematical context and symbolism for representing the mecha- 
nisms of chemical reactions. 

Of mathematical interest is the specialized set theory which de- 
velops from this application. The combination of set-theoretic no- 
tions, together with the superimposed chemical and probabilistic 
structure as determining factors of the individual molecular trans- 
formations, forms a molecular set theory, or theory of sets of 
molecules undergoing chemical transformation. It represents an 
extension of the notion of ordinary abstract mathematical trans- 
formation to include material, experimentally determined trans- 
formations. 

This approach is of particular interest at the present time in the 
light of the work of Rashevsky (1954-1959) and Rosen (1958, 
1959) toward the development of a topological representation 
theory for biological systems. 

1. The stoichiometric equation. Consider, for example, a ‘‘bal- 
anced equation’’ between hydrogen and oxygen gases and water: 


2H, +0, =2H,0 (1) 


Interpreting this as an ordinary algebraic equality, if H, represents 
two atomic weights of Hydrogen = 2.016; O,, two atomic weights 
of oxygen = 32; and H,O the molecular weight of water = 18.016, 
then substituting these values in (1) gives 2 (2.016) +32 =2 
(18.016), an unconditional equality or mathematical identity. 
Furthermore, given the actual weight relations of the reaction mix- 
ture and considering the symbols H,, O, and H,O in (1) as alge- 
braic variables, the resulting weight of the product, water, may be 
obtained by solving this as an ordinary algebraic equation. Similar 
algebraic considerations apply if one considers that on the left of 
equation (1) there are four atoms of hydrogen to every two atoms of 
oxygen, which must equal the atomic constitution of the products 
appearing on the right-hand side. 

If the same notation is used to express the mechanism of reac- 
tion, and if in this case, the complete mechanism is assumed to be 
of an elementary type consisting of the direct conversion of hydro- 
gen and oxygen gas to water, with no intermediate steps, then this 
would be indicated in the customary fashion by writing: 


2H, + i. — 2H,0 (2) 


CHEMICAL REACTION MECHANISMS 287 


the arrow indicating that the reaction proceeds from the reactant 
chemical species on the left to the product chemical species on 
the right. In the present paper the notion of the new kind of mathe- 
matical mapping or set transformation expressed by the arrow will 
be formalized. It will be combined with the conception of the 
chemical species involved, as material, or non-abstract sets of 
molecules, and made subject to the laws of composition similar to 
those which govern ordinary set-theoretic transformations. The 
result of applying this theory to the mechanism implied by equation 
(2), for example, is the representation: 


7: Hes SO > (He0)=. (3) 


A set of rules for such representations will be defined by examples 
of various basic mechanisms of reaction. 

All of the mechanisms represented in this paper are assumed to 
be nondecomposable into more basic units. They will be taken to 
represent reactions which are explainable as elementary collision 
processes. Thus, for example, when the unimolecular reaction 
process is studied it is considered either in an isolated sense as 
a complete mechanism or potentially as a step in a more complex 
mechanism. The Lindemann-Hinshelwood (See Lindemann 1922, 
Hinshelwood 1927) scheme, for example, which has been used to 
explain unimolecular decompositions in terms of collisions be- 
tween molecules of the same species is not assumed when we 
write 4 — B. The latter symbolism applies only to a complete 
elementary mechanism. It is, in fact, but one step in the Hinshel- 
wood-Lindemann Theory. 

It is also understood that all reactant and product species are 
imbedded in an ideal space of constant volume V in which all 
parameters effecting reaction are held fixed. This assumption will 
be needed particularly in section 3 where the connection between 
the suggested representation and kinetic analysis of reactions will 
be discussed. 

2. The Point-set-Theoretic Representations of Reaction Mecha- 
NisMs. 

Case 1. The Unimolecular irreversible transformation. 

The usual representation for the simplest conceivable unimolec- 
ular reaction is: 4 — B, where A is the single reactant species 
and B the single product species. The symbolism indicates that 
the reaction mechanism consists of the chemical transformation of 


288 ANTHONY F. BARTHOLOMAY 


each molecule of A into a form which is identified as a molecule 
of a species B. 

Consider A in a new mathematical sense as a ‘‘molecular set’’ 
consisting of a number vg of similar molecular elements, the 
molecules of reactant species A; and B, initially, as a skeleton 
set to be filled by v, similar molecules belonging to chemical 
species B. In this context the chemical transformation mechanism 
is represented as 


‘ 


LAr Bs (4) 


which, in the mathematical terms of molecular set theory connotes 
a transformation T of the elements of the domain set A onto the 
elements of the range set B. The value of the transformation (4) 
applied to an individual element a@e¢A is denoted in the usual 
mathematical fashion by 

T (a) =b (4°) 
where 6 €B. 

Unless otherwise indicated, the symbolism 7, as in (4), will re- 
fer to the chemical transformation itself. It has a real, operational 
character in contrast to the analogous mathematical abstraction. 
It characterizes a dynamic sequence of chemical and chance 
events which culminates eventually in the assignment of a definite 
element 6 €B to a given element a € A (see section 3 on kinetic 
considerations). Once the reaction is complete and this assignment 
has been made, transformation (4) of course can be understood in a 
more purely mathematical sense, simply as an ordinary mapping on 
A to B. 

Subsequent cases will demonstrate further the nature of the dif- 
ference between chemical or material transformations 7 and their 
abstract counterparts. There is not enough complexity in this 
simplest unimolecular case to emphasize this, for the elements of 
B are simply alterations of the corresponding elements of A. In 
other words, if initially all the elements of A are ordered arbi- 
trarily as a,, @,,.., a.) then 


T (a) = b, @ =1, 2,..., 9) (5) 
Signifies that as a result of 7’, a retains its atomic constitution 


but changes its geometrical configuration to the extent of being 
identified as an element b, of B. The resulting mathematical 


CHEMICAL REACTION MECHANISMS 289 


correspondence connoted by (4) is thus a very simple 1-1 corre- 
spondence; and the chemical transformation itself is completely 
determinable a priori, the only indeterminate factor being the time 
relationships of individual transformations (see section 3). In 
subsequent cases, both the simplicity of the mathematical associ- 
ation of elements of the reactant and product species and the com- 
plete determinism of the transformation will be seen to be lacking 
@ priori. 
Case 2. The bimolecular transformation 
At Ag, +634 8. (6) 


In this elementary process it is assumed that a molecule of re- 
actant species A, combines with a molecule of reactant species 
A, to produce three new molecules, one each of product species 
B,, B, and B,. This kind of mechanism occurs frequently in 
chemistry. The designation of three product species components 
is arbitrary. 

In this case the molecules in the reaction mixture molecular set 
are elements of two independent sets A, and A,. Reaction occurs 
when an element of A, is effectively paired with an element of 4,. 
The molecular set-theoretic expression for this pairing is borrowed 
directly from the abstract mathematical expression for the Car- 
tesian Product of two sets, A, and 4,. It is denoted by 
A, ® A,, the set of all possible (a priort) pairs of molecules (a,, 
a,) where a, € A, and a, € A,. Similarly the space of products 
of the reaction may be represented by the Cartesian Product 
[B,® B,® B,] denoting the triplets (0,, 5,, 6,) of three different 
kinds of molecules produced by the effective collisions of the 
pairs (a,, a,)- Thus, the reaction mechanism is represented mathe- 
matically as the molecular set transformation: 


T: A,@A,— B,® B,e@B, (7) 

where individual values of the transformation are expressed as: 
T (a,, 4,) = (@,, 5, 45) (8) 
and where it is understood that T ( ; ) has the value zero 


if either an a, or an @, is missing, so that the reaction stops when 
either the set A, or A, -is exhausted of all its elements. In other 
words, if v,, = initial number of molecules of 4, present in V and 
v.. = initial number of elements of A, present, then min (93 vi) 


20 
determines the ‘‘size’’ or ‘‘rank’’ of the set of transformation 


290 ANTHONY F. BARTHOLOMAY 


values; or, stating this in another way, the rank of each molecular 
set B, (i = 1, 2, 3) at the completion of the reaction will be min 
(V 19> Y a9): 

Again the product space B=B,® B, ® B, is determined ex- 
perimentally, the laws of chemical combination together with the 
laws of chance in control of random collisions cooperating to bring 
arbitrary molecules of A, and A, into effective collision. In other 
words, assuming v,, Sv,, and arbitrary orderings of the elements 
A, and A,, a particular molecule Ng é A,, will eventually be 
paired effectively with some molecule Gal € A, to give rise toa 
triplet (baniae ba anu pee of molecules of B, ®@ B, @ B,, in 
which case T (a,‘”, a,‘*») could be defined as zero when k #4 j as 
well as when a,/? =0. However, a priori there are only probable 
pairings, the final pairings being settled on an experimental basis 
as the transformation 7 runs its course. 

In the purely mathematical analogous case, for example, an 
ordinary set mapping might be specified in advance by ordering 
arbitrarily the elements of two abstract sets A, and A, and then 
giving as the defining rule for T the association T (a, 2,7) = 
(6,°?, b£, 6°). If, say, vig =2, vo) =8, then the three pairs 
(a6?, a6), (a6, a6), (a0, @,{%) would all correspond to 
(6,6, 6°, 6,°1); and the three pairs (2,67), a,°!), (2,67, a6), 
(a, a8), all to (6,52, 5,69, b,°?), The mapping would be 
from A, ® A, onto B, ®@ B,® Bs. 

In the chemical case, on the other hand, when v4) < vg, there 
will always exist some subset of A, @ A,, say A,®@ A, (where 
A, © Ag; i.e., the molecules of A, are chosen from those of A,) 
such that the transformation will establish a one-to-one corre- 
spondence between it and the skeleton space B, @ B, @ B,. ff 
the nul2 set @ is included in the product space and T (a, a) 
is set equal to 0 € @ for all remaining a,‘/”s then such a trans- 
formation would be called ‘‘onto.’? In any event the A-space of 
reactants is decomposable into a nontrivial subset which is in 1-1 
correspondence with B, and a trivial subset which corresponds to 
the null set 9. 

Case 3. The Multimolecular transformation 


OH, A, +0, A,+...+0,4, +8, B,+B,B,+...+8,B, (9) 


In this most general multimolecular reaction mechanism it is as- 
sumed that 4, molecules of reactant species Aj ;ats of Aa wang 
a, Of A,, all collide simultaneously and effectively to produce 


CHEMICAL REACTION MECHANISMS 291 


(8,+B,+--.+8,) new molecules: 8, molecules of product 
species B,, B, of B,,..., 8, of B,. The order of molecularity 
here is (4, +...+ 4). Instances of anything greater than termo- 
lecularity are rare, for in such cases the mechanism usually will 
proceed in successive steps building on mechanisms of low mo- 
lecularity. However, to complete the general theory, the mecha- 
nism (9) is considered with any positive integral value of m. 
Reaction occurs when simultaneously a combination of 4, 
elements, say (2,1, @,9,+++; G1.) of species A, is joined with 
&, combined elements (a,,, dgo,+++; a4.) of A,;...; with a. 
combined elements (a,,,, a9, +--+; Oct m) of A,. Thus, as the re- 
action proceeds under the influence of chemical and random forces, 
a; molecules of each reactant species A; (i=1,2,...m) are 
grouped together in each effective transformation multiplet. At the 
end of the reaction each chemical species A, will have been sub- 
divided into a ‘‘residual’? molecular subset A, and a number 


| of subsets, {A,},, each of size «,, where the symbol [ ] 
Oo, 


v , v 
means ‘‘greatest integer in,’’ and where eal = min (Eat 
k 1 


Fa eae Fal) . Introducing (A,)“ to denote this decompo- 
ro o 

2 m . 

sition of the original set of molecules of species A,, this molecu- 


lar set decomposition will be symbolized by: 
(4)% =14), u f4jgu-.. VEASE, Vu A, 
Oy, 
(¢ =1,2,...5 m) (10) 


where the symbol ‘‘u’’ stands for ‘‘set-union’’ and is defined by 
direct analogy with ordinary set theory. In (10) it is also assumed 
that all components are mutually exclusive with respect to the 
others: i.e., 


{Aj}, {Aj}, =9 (null set) (11) 
{Aj nA, =9 (12) 


Vv : 
where 748 and 7,s =1,2,..., Fa the notation ‘‘n’’ stands for 
k 


292 ANTHONY F. BARTHOLOMAY 


‘‘set-intersection,’’ and again is defined in the obvious manner by 
analogy with the common set notation. 

If one assumes the ideal homogeneous case in which, theoreti- 
cally, (1) all molecules in a given species are morphologically 
identical and (2) all of the 4, molecules of the same species 4; 
in an effective transformation multiplet are interchangeable, then 


these a, molecules form what can be called a ‘‘molecular equiva- 


lence class’’: each {A,}. (r=1,2,..., ar would be such a 


class. The ‘‘equivalence relation’? depends on the structural 
properties of the molecular elements as well as on the transforma- 
tion T (and those factors which determine 7). 

Similar considerations apply to the B-space, i.e., the set of 


product species B,, B,,..., B,. Thus, in terms of molecular set 
notation reaction (9) is expressible as: 


T: (4,)% @ (A,)* @...@ (A) ™ > 


(B,)*! @ (B,)@...@(B)*™ (18) 


where it is understood that this representation becomes definite, 
and the equivalence classes specified, at the completion of the re- 
action. It is also understood that the transformation is not defined 
over elements of residual sets A,. The resulting number of mole- 


cules of species B, will be B,. “ko (where j =1,2,...,n). 
a 


k 
According to (13) and (10) an individual transformation is de- 
fined by selecting an element of ‘‘equivalence class’ {A,}_ of a, 
elements of species A,, combining it with an equivalence class 


{Ag}, of a, elements of species A, etc. to obtain an element of 
the product space B: i.e., 


Tr (1A,},, {Ag},.---, (4,3,) = (1B, Hh, {Ba},..--,{B,}), (14) 


v 
(rher fal Dewey . In this mechanism, then, equivalence 


classes or sets of a, molecules form the individual elements of 
the domain space of the transformation. 

Comparing this with the bimolecular mechanism, similar points 
can be made on the mathematical correspondence established be- 
tween the reactant A-space and the product B-space. A one-to-one 
correspondence, in the course of reaction, will be established be- 


CHEMICAL REACTION MECHANISMS 293 


tween the nontrivial components of each species: the equivalence 
classes of A and the equivalence classes of B. 
Case 4. The chain transformation: (for example:) 


MA, tac. +O 4. — BB, +...B B => ¥iC, + 
ypc a: (15) 


Using the molecular set notation of the multimolecular transforma- 
tion, the mechanism in this case is composed of two transforma- 
tions: 


T ,: (A,)“*@.--@(4,)°™ — (B,)* @...@(B_)hn 
- (16) 
B86.) @...0(B.)" + (C,)"* &.,.0(C)”° 


Representation (15) suggests that there is induced a direct com- 


posed overall transformation JT =T7,0T7,, by which would be 
meant 


T=T, 0T,:(4,)% @---@(4,) ™ —> 
ory oe ie (6). -= (17) 

where T ({A,},..., 4A, },) = 79 (T; ((Ayip+++5 {4,5,)) 
arte toga, 4 4}:) (18) 
et ae 


(where Goh Dire iy Fall 
es 


However, there is not necessarily a coincidence here of the space 
containing the range of 7, with that containing the domain of 7), 
for in the course of reaction, space B is substructured into equiva- 
lence classes in one way as a product molecular set of the mapping 
T,, and possibly in a different way when it serves as the reactant 
molecular set of the mapping 7,. The sequential representation 
given in (15), is therefore misleading if molecular events are con- 
sidered. From the point of view of Information Theory, there is a 
nonzero probability that at the second stage of a chain reaction 
the information or entropy involved in the chance selection of 
equivalence classes and transformation multiplets might approach 
in magnitude that involved in the first stage. 


‘S (19) 


° 


294 ANTHONY F. BARTHOLOMAY 


These considerations for a two-step chain reaction extend in 
obvious fashion to chains of any length, a phenomenon which oc- 
curs in radioactivity, for example, where all the stages are uni- 
molecular. In long chain reactions the complication just described 
may be likened to that which characterizes exact sequences of 
groups and homomorphisms (see Hurewicz 1941 and Kelley and 
Pitcher 1947) and of modules and homomorphisms (Cartan and 
Eilenberg 1956). The corresponding definition of exactness in 
chemical chain transformations of molecular sets is as follows. 

Def. Exact sequence of molecular sets and transformations. If 
A,, Ag,...,A, are molecular sets of any kind related by the trans- 
formations 1,:4;—» Aj; T,:A, —\4,;...; °T,_72 4,5) 4m 
the sequence is ‘‘exact’’ when the ‘‘image’’ of 7, coincides with 
the ‘‘kernel’’ of T,,,, where K=1,.., p—2. 

The ‘‘image’’ of a molecular set transformation 7: A —> B is the 
set of molecules in the product B-space actually generated by the 
totality of applications of T on the transformation multiplets in the 
reactant A-space; the ‘‘kernel’’ of T is the subset of molecules or 
possible reactant multiplets (whichever is appropriate) in A on 
which 7 fails to act in the actual reaction course. Clearly, in 
multimolecular reactions involving long chains of successive 
transformations, a condition of sequential ewactness has a high 
probability of establishing itself. If, for example, the sets A, in 
this definition are compound sets such as these which occur in the 
domain or range of a multimolecular transformation 7, (in which 
case equivalence classes of molecules form the reaction multi- 
plets), then in going from A, to A,, and then to A,, etc., the vari- 
ous equivalence classes which arise in the first stage as images 
of repeated applications of 7, on A, soon become mized; they are 
therefore not transformed to A,, as explained above. The excep- 
tion to this rule is the chain of unimolecular transformations, in 
which each A is a single species (with coefficient 1). In this 
““‘non-exact’’ case, since ultimately each molecule goes into the 
form indicated by the subsequent species, the image T, is never 
equal to the kernel 7,_, (which is always null). Thus, in this 
special case only the usual representation (15) used in the chemi- 
cal literature leads to no contradictions. But in all other cases 
the cautions indicated here apply; the non-sequential representa- 
tion given in (16) and its obvious extensions to longer chains are 
therefore advanced as being more interpretive of the molecular 
events of reaction. 


CHEMICAL REACTION MECHANISMS 295 


Case 5. The reversible unimolecular transformation A= B. If 
in the unimolecular version of the previous case one substitutes 
for the C-space the original single reactant species A again, one 
obtains this case. However, because of the cyclic nature of the 
mechanism, certain special considerations enter into this case. 

The ‘‘overall reaction’’ is of indeterminate length in this case. 
On the other hand, certain properties of the ‘‘composed’’ trans- 
formations may be anticipated. Thus if one uses the molecular 
set representation suggested in the previous case, the reversible 
unimolecular reaction will be written: 


T,:A—~B (20) 
T,:B—-A (21) 


and T, will be called the ‘‘inverse’’ of T,. Moreover, if the follow- 
ing relations are valid (which seems likely in isomeric transforma- 
tions of this kind): 


T, (T,(a)) = T,(b) = a (for all ae A) (22) 
T, (T,()) = T,(a) = } (for all b € B), (23) 


then T, could be written T~* since T,(B) would coincide with the 
T,*(B), the ‘‘pre-image of B’’ under the transformation 7,. More- 
over in such a case T, 0 T, = T, 0 Ty =/, the ‘‘identity transforma- 
tion.’’ Thus a sequence of alternating 7,’s and T7,’s always re- 
sults in the same aor b, as the case may be, at any stage in the 
sequence of transformations. (See section 4 for the discussion of 
this and the following cases in broader terms). 

Case 6. The multimolecular ‘‘reversible’’ transformation, as for 


example 
A, + Age 8, +8, +8; (24) 


is not very likely to be found ‘‘reversible’’ in the same sanse as 
the preceding special case. This stems from the fact that such a 
mechanism is prone to the same kind of randomness which was 


discussed in case 4 preceding. 
Here, once again, it would be more proper to represent the total 


mechanism by two independent transformation expressions: 
T,:4,04, ~ 8,28, 0B, (25) 
T,:B,®B,®B, > 4,® Ans (26) 


296 ANTHONY F. BARTHOLOMAY 


The complicated ‘‘genetics’’ of this case are well illustrated in 
the following diagram: 


Thus, only under most exceptional circumstances would the follow- 
ing hold: 


T, (T, (41, 4)) = Ty (4, bg, 55) = rte (0,, 54,°5,) =(@,,¢,) (27) 


Case 7. Parallel Unimolecular Reaction Transformations: 


A+B, ots 
A+B, or A—B, (28) 
A->B, B, 


An example of this kind of mechanism would be the isomerization 
of &-pinine (see Frost and Pearson (1948), p. 318). The language 


of set theory provides an appropriate mathematical representation 
of this mechanism also; viz., 


T:A—+B,UB,uUB, (29) 
In this transformation, the range or space of products is the mo- 


lecular set-theoretical union of three mutually exclusive subsets 


B,, B, and By. This is useful in signifying that the result of 
transformation 7 applied to an element of A will be: 


T (a) = b, or by or bg. (30) 


CHEMICAL REACTION MECHANISMS 297 


Clearly in this case the ultimate size or rank of each of these sub- 
sets will be determined by a combination of chemical and random 
effects. The total size of all of the sets B,, By, B, will equal 
the initial size of A. 

Case 8. Two Parallel Unimolecular transformations producing a 
common product. Consider a complex reaction of the type which is 
commonly represented by: 


A,-B A 
or Py? (81) 
A,—B ype 
This kind of mechanism has been found to operate in the hydroly- 
sis of mixed tertiary aliphatic chlorides (Brown and Fletcher 
1949). It also arises commonly in radioactivity when radiation 
from mixed radioactive substances is being observed. 


The appropriate molecular set-theoretic representation in this 
case is 


T: A,v A, —>B (32) 


where the reactant A is composed of two mutually exclusive com- 
ponents A, and A, and where a transformation T has a choice be- 
tween either set. Eventually all possible elements of each com- 
ponent become transformed into B so that the ultimate size of B is 
the sum of the sizes of A, and A,. 

The representations of more complicated mechanisms constructed 
of combined elementary transformations can be synthesized from 
the transformations given in the preceding cases, using the same 
point set-theoretic language. 

3. Kinetic Considerations. Since in molecular set theory one is 
dealing always with a dynamic situation, the transformation T is 
also a function of time. For simplicity of representation explicit 
mention of the time factor in the proposed molecular set-theoretic 
representations has not been made. However, for purposes of 
examining the ‘‘kinetics’’ of the reaction process the more com- 
plete representation should be used. Considering, now, the sim- 
plest unimolecular decomposition of case J, the augmented repre- 
sentation would therefore be: 


T: A@I—+Bel, (33) 


where now / stands for the time interval [0 <¢< Tl], in which T, the 
total reaction time may, of course, be infinite, theoretically. This 


298 ANTHONY F. BARTHOLOMAY 


transformation indicates that at time ¢, a given molecule a; & A 
(assuming in advance an arbitrary ordering of the molecules of A) 
is transformed into a new molecule 6, € B, identified with the 
same time @. 

Neglecting, for the moment, the probability aspects, and adopt- 
ing the classical, deterministic point of view, the time progress of 
this reaction is described by the continuous time function 


v=Vy en (34) 


where & is the rate constant; v, the number of molecules of A 
present at time ¢; and vo, the initial size or rank of set A. This 
time relationship (34) can be used conceptually to determine the 
pairings of molecules with corresponding time units as in the fol- 
lowing diagram, which has been drawn for very small v, for pur- 
poses of making the illustration clear: 


lotatats te tho ti2 t 


totatsts te tio ti2 t 


FIGURE 2 


The upper graph shows the times ¢,, 4, ..., at which successive 
individual transformations take place. These times have been 
transferred down to the lower diagram in the form of ‘‘unit im- 
pulses’? corresponding to the values of a characteristic function, 


CHEMICAL REACTION MECHANISMS 299 


x, 1.e., a function having the value 1 for transformation, and 0 
otherwise, at the appropriate times. According to the deterministic 
theory, the spacing of these times is governed by the continuous 
function in the upper part of the diagram. Thus, in terms of the 
Cartesian pairing notation, the arbitrarily ordered element number 
i, i.e. a; & A, may be paired in actual fact with the time t, (fF 2). 
Therefore, the element (a,, t,) of A@/ can be rewritten (a,, t,) 
by reordering the elements a. The whole pair can now be replaced 
by the single notation “*a,.”* In such a case the subscript will 
carry the time ordering of the particular transformation. The same 
principle holds for 6, = T (a,). 

It is interesting in terms of this diagram to call attention to the 
difference between the deterministic theory, exemplified by the 
lower diagram of Figure 2, and the more recent stochastic theory 
(Bartholomay, 1958, 1959). The time sequence of individual trans- 
formations recorded as shown is expected according to the de- 
terministic theory to have, intrinsically, always this same spacing 
along the ¢ axis, (molecular identifications being disregarded). 
However, according to the stochastic theory no such complete 
regularity of pattern is predicted. The particular pattern shown 
represents simply an average picture which would emerge from 
many repetitions of the same chemical process under identical 
conditions. In other words, in an actual case, according to the 
stochastic theory the spacing between successive ¢@’s will not be 
perfectly predictable, nor will the number of such unit impulses be 
always the same in a given time interval. The law governing the 
distribution of these values is derivable from the stochastic model 
as 


p(N, t) = bine ») REECE Sa ny aay (35) 


giving the probability of N such transformations in time ¢. 

In a more complicated case, say case 4, the complete molecular 
set-theoretic representation is obtained by the Cartesian adjunc- 
tion to the domain and range spaces of time intervals /, and /, 
corresponding to the first and second steps of the transformation, 
respectively: 


T,: (A) '@---@(A,) ” @1, > 
(B,'e@...0(B, "el, (36) 


300 ANTHONY F. BARTHOLOMAY 
By, 
T: (B,)1@-:.@(B,) "el, = 
(ey Sexe Cy hela ae 


Thus, ({A,}, , {4g}; .---, {4,,3, ) would be the ith multiplet of 
by lo m oe, 


equivalence classes, ordered according to some arbitrary scheme. 
Using the representation (36), the transformation of this multiplet 
at time ¢, would be recorded as: 


(144), 5 gh eee CAS, 3 6): (38) 


As in the previous case, the time scale may be telescoped into 
the ordering of the reactant species by referring to (38) as the j*® 


multiplet and writing simply 


is 


pare =~ 


i.) (39) 
The analytic expressions for the number of individual transforma- 
tions taking place in a given time in this general case, correspond- 
ing to (34) and (35) in the preceding case, would of course be 
complicated and have closed forms only in special cases (Barthol- 
omay 1957). If these expressions are determinable, the identifica- 
tion scheme given in the discussion of Figure 2 can be followed 
in this case as well. 

The property which is emphasized by the representation here is 
the dynamic aspect of the concept of molecular set transformation 
(in contrast to the meaning of ordinary set mapping). Thus, con- 
sidering the interval /, when ¢=0, 7: (A, 0) —(B, 0) is not de- 
fined; at time ¢, where 0<¢<T, T: (A, ¢) — (B, 2) refers to the 
particular chemical transformation occurring at time ¢ The actual 
specification of which molecule or multiplet of molecules is asso- 
ciated with time ¢, is something determined by chemical and proba- 
bilistic parameters and forces and is not determinable in advance 
of an actual experiment. But the molecular transformation T may 
ultimately induce an ordinary transformation in certain cases if 
the action is completed in finite time (T< ~). Molecular set theory 
then may be said to ‘‘merge’’ with ordinary set theory at this final 
stage. This would occur in a small, finite time in completely ir- 
reversible reactions (ideally speaking). The relation between the 
mathematical mapping and the molecular set transformation in such 
a case is similar to the relation between a statistical sample value 


CHEMICAL REACTION MECHANISMS 301 


# and its parent random variable X. A priort X has @ fora possible 
value with an assigned probability. At the completion of the ex- 
periment z either Aas occurred or has not. In any event X is then 
assigned a definite value, which may be 2; and thenceforth an 
ordinary numerical value replaces the random variable. However 
in reversible reactions, there is no such merging for an exchange 
of material goes on indefinitely even at so-called ‘‘equilibrium’’; 
from a mathematical point of view the situation is indeterminate 
and points to a basic conceptual difference and independence be- 
tween ordinary and molecular set theory. 

4. Discussion. It has been pointed out that expressions such as 
(2), (6), (9), (15), (20), (28) and (31) have the obvious algebraic 
connotations if the chemical formulas; i.e., the A’s, the B’s, the 
C’s, etc., represent the weights or the atomic constitutions of the 
corresponding species. But this algebraic representation which is 
consistent with the purely stoichiometric aspects of a reaction is 
not consistent with the mechanistic aspects. If the A’s, B’s and 
C’s are identified not with measurable algebraic properties such 
as weight, but with the corresponding sets of molecules, (extend- 
ing the notion of abstract sets to the case of these material sets), 
which are referred to as ‘‘molecular sets,’’ then a more appropri- 
ate mathematical setting for the representation of mechanisms be- 
comes clear. The arrow, already in vogue in such cases, is placed 
in the mathematical context of a molecular set-theoretic mapping, 
and the molecular associations defined by particular reaction types 
are expressed by incorporating into molecular set theory, notions 
borrowed from ordinary set theory such as the intersection, union, 
and Cartesian product. The ‘‘algebraic’’ expressions (2)-(31) re- 
ferred to in the preceding paragraph are then rewritten as molecu- 
lar set-theoretic transformations as in the corresponding expres- 
sions (4), (7), (9), (18), (16), (21), (25), (26), (29), and (32). No- 
tationwise, this representation can be extended to formalize 
pertinent characteristics of the reaction medium by the adjunction 
of further Cartesian components to the domain and range spaces. 
For example, in section 3 in discussing the kinetic aspects of the 
mechanism, the time domain / was adjoined. More generally, one 
can adjoin further Cartesian components separated by the symbol 
® indicating the recognition of numerical parametric domains in 
control of particular reactions such as 6@ for temperature, P for 
pressure, pH for indications of the conditions of acidity, etc. In 
cases where exact controls on these parameters are exercised, the 


302 ANTHONY F. BARTHOLOMAY 


comma can be used to separate the various conditions, thinking of 
them as fixed co-ordinates. 

Thus the present investigation explicates the fact that whereas 
chemical stoichiometry is expressible mathematically in the lan- 
guage of algebra, chemical mechanism is more properly expressed 
in molecular-set-theoretic form. 

The mathematical model proposed here is also consistent with 
the stochastic treatment of reaction rates proposed by the author 
(1957, 1958) which centers on the probabilistic aspects of reac- 
tions in terms of individual events. Both are parts of a program to 
re-examine the current mathematical basis of chemical reaction 
kinetics from the more discrete point of view of the actual systems 
of molecules and chemical events involved. It emphasizes those 
features of kinetics which lie between the more microscopic quan- 
tum theoretic aspects of molecular transformation and the macro- 
scopic approach of the classical, deterministic theory of reac- 
tion rates. A mathematical rationale which forms a bridge be- 
tween these two areas should provide a broader mathematical 
base for dealing with both theoretical and empirical questions in 
kinetics. It has already been shown that from the stochastic 
model one can deduce a new statistical formula for first order rate 
constants (Bartholomay 1959) which takes into account uncon- 
trollable random fluctuations and which is simple to apply directly 
to observed concentration-time data. 

In the present investigation the discovery of a set-theoretic con- 
text specific for reaction mechanisms suggests points of entrance 
for mathematical reasoning borrowed from ordinary set theory into 
the delineation and prediction of different kinds of mechanistic be- 
havior. For example, corresponding to an ordinary set mapping T: 
A-—»B such that T (a) is a unique element } € B for a given 
element a of A, there exist ‘‘inverse’’ mappings 7’: B —» A such 
that T’ (T (a)) =a’ € A. In the special case where T’ (T (a)) =a 


and 7 (T’ (6)) = 6, for each a A and each b € B, T’ may be re- 
ferred to as a ‘‘strict inverse’? 77}. 


It is interesting that if on the basis of this mathematical ra- 
tionale one looks for different kinds of inverses in terms of the 
chemical molecular set transformations, they are indeed present. 
Attention was focused on this very problem in the discussion of 
cases 5 and 6 in the text. The distinction arises as a consequence 
of differences in order of molecularity of elementary reaction 
mechanisms. The unimolecular reaction, in which A and B repre- 


CHEMICAL REACTION MECHANISMS 303 


sent single chemical species, furnishes examples of chemical re- 
actions which can have an inverse in the strictest sense just men- 
tioned. This is due to the fact that there exist in nature mole- 
cules, say of type A, which can, with no changes in composition 
but simply by structural transformations, exist as molecules of 
type B: for each molecule ae A there exists an ‘‘isomer’’ or iso- 
meric form 6 € B. Thus, given the isomeric unimolecular reaction 
T: A —» B, the inverse transformation B + A will be of the form 
T~*': B-» A with the composition (77! 07) or (T 0 T~') leading to 
recovery of the original molecule (assuming, of course, that the 
transformations are accomplished without intermolecular atomic 
exchanges). 

The significance of such reactions in biological systems, say 
as steps in a complex system of reactions, is clear from the sim- 
plicity of the transformations involved and from the dual structural, 
biological specificity which could be allotted to essentially the 
same molecule. It has in fact been assumed in some developments 
of enzyme kinetics that in certain reactions the enzyme-substrate 
complex undergoes such reaction, resulting in the establishment 
of an equilibrium between the two different forms of intermediate 
complex preparatory to reaction. Thus reversible unimolecular re- 
actions may be thought of in terms of a biochemical systems theory 
in much the same sense as electrical diodes in a complex electri- 
cal network or computer mechanism. 

In the case of the more complicated multimolecular reaction 
mechanisms, as has been pointed out in the discussion of case 6, 
from collision-theoretic and probabilistic considerations it seems 
unlikely that a given multimolecular transformation could be re- 
versed in the strictest sense just explained. Referring to the ex- 
ample of case 6: T: A, ® A, > B, ® B, ® Bg, the mathematical 
context of this mapping still implies the existence of an inverse 
mapping 7’: B, ®B,® B, > A, ®A,. But in this reversed re- 
action one cannot state with certainty that the original pair (a,, 
a,) will be reconstituted by collisions in the B-space of the mole- 
cules of the triplet (b,, 64, 5,3) produced by the original (a,, a.) 
molecules of the A-space. In other words 7” (T (@,, @,) =(@{, @3) 
where a; and aj may be ‘“‘new”’ or ‘*partially new’’ members of the 
A-space, and only in exceptional circumstances will a, =a, and 
a,=a,. There is only a probability that the original molecules 
are recoverable by such a pair of successive transformations. One 
is therefore faced with the possibility that once such a reaction is 


304 ANTHONY F. BARTHOLOMAY 


initiated, the A and B spaces will be constantly changing their 
moleculat constituents and replacing the original molecules of a 
given species by others which are similar to (but not identical 
with) the original membership, subject to the limitations of the 
purely combinatorial problems involved. In theory, then, the multi- 
molecular reversible mechanism would be mathematically dis- 
tinguishable from the isomeric unimolecular case. This theoretical 
prediction could actually be realized, for example in biosynthetic 
incorporative experiments based on the principle of radioactive 
isotope tagging. 

Questions of a very similar kind surround the molecular-set- 
theoretic concept of exact molecular sequences of transformations. 
Specifically, the question arises as to whether there are exact se- 
quences of naturally occurring chemical reactions. Again this 
question resolves itself in terms of the molecular order of each 
step in such a chain of reactions. In the unimolecular case, which 
occurs for example in radioactive decompositions, there is the 
opposite of exactness, the non-eract sequence: the range set of 
each reaction transformation determines the entire domain of the 
succeeding transformation. In the multimolecular case ‘‘steric’’ 
factors make it clear that as a result of the mixing of the individual 
multiplets produced, a condition which approaches exactness must 
come about: an effective transformation multiplet is formed to 
which a transformation T is applied, which has for its ‘‘value’’ a 
multiplet of molecules representing the different chemical species 
in the product domain of T,. It cannot be said with certainty (say, 
probability one) that the newly formed multiplet will itself be an 
effective one; i.e., that the member molecules will collide ef- 
fectively and serve as an element of the domain of the next trans- 
formation T,. Some of its molecular substituents will collide with 
the complimentary substituents of other such newly formed multi- 
plets, so that in the next transformation 7, the identities of the 
original reactant molecules will not be preserved. Obviously in a 
long reaction sequence of this type the probability of preservation 
of these original identities must grow monotonically smaller. 

It is possible that in a living organism conditions can be so 
regulated that even in multimolecular reaction chains non-ezxact- 
ness, as in the unimolecular case, becomes the most probable se- 
quential mechanism. Very fast reaction rates, biological mole- 
cules of very high specificity, etc., could militate to bring about 
such a highly ordered system. In terms of the biochemical aspects 


CHEMICAL REACTION MECHANISMS 305 


of heredity and development, chains of biochemical reactions of 
the non-exact type would promote the direct transference and 
preservation of hereditary characteristics carried by the original 
molecules in the chain. The idealized assumption of the indis- 
tinguishability of molecules of a given species would be an argu- 
ment against such a rationale. However, of particular pertinence 
here is the discovery made by Dr. Wacker and Dr. Vallee (1959) in 
this laboratory, that RNA from various sources contains significant 
amounts of firmly bound metals: magnesium, calcium, strontium, 
barium, aluminum, chromium, manganese, iron, nickel, copper and 
zinc. This finding correlates with previous findings that ribo- 
nucleoprotein contained large amounts of the same metals. These 
investigators state: ‘‘These elements have not heretofore been 
found to be aggregated to this extent in any other biological ma- 
terial; this supports the postulated specificity of interaction be- 
tween metals and RNA. Since ribonucleic acids are known to be 
inhomogeneous, further purification may result in the isolation of 
ribonucleic acids which contain only one specific metal.’’ While 
the role of metals in these complex molecules has not been eluci- 
dated as yet, it is possible that their association with metals im- 
poses a distinguishability on the molecules which is preserved in 
the chain of reactions in which they are involved. Such findings 
are already being examined in relation to the transmission of 
genetic information and protein synthesis. Thus, it is possible 
that by various chemical and physical agents a distinguishability 
may be induced on seemingly homogeneous populations of mole- 
cules of a given species, lending functional significance (in the 
biological sense) to some of the finer distinctions which are de- 
ducible from the molecular-set-theoretic model of reaction mecha- 
nism. 

5. Conclusion. In addition to suggesting a new representation 
for chemical mechanism the present investigation extends and 
makes contact with the mathematical basis for discussing chemical 
reaction kinetics in terms of discrete and random events, begun in 
earlier work (Bartholomay 1957-59). For example, a molecular set 
transformation 7 is not an ordinary set mapping, but is determined 
by a combination of chemical and random effects operating in a 
time domain. The reduction to a purely mathematical mapping is 
thus not a priori determinable; it is a part of the stochastic as- 
pects of chemical reaction kinetics, as emphasized in section 3. 

In a larger context, the origin of the notion of molecular sets 


306 ANTHONY F. BARTHOLOMAY 


and transformations from the study of natural (chemical) events 
adds further interest to the work of Rashevsky (1954-59) and 
Rosen (1958-59) on the use of topological representation in the 
analysis of biological systems. Topology, which deals in the re- 
lational aspects within and between geometrical configurations, 
may be implicit in many other areas beyond those already covered 
in-this work and in the work of the other authors cited. 

The name ‘‘molecular set theory’’ has been coined in the present 
study to emphasize the fact that it is a mathematical description 
of a natural process, as opposed to a mathematical abstraction. 
(Most mathematical models fit into the latter category.) The mo- 
lecular set model therefore suggests, by connotation, some new 
mathematical problems in ordinary set theory, in the language of 
which it is expressed. For example, the molecular set notion of 
‘fexact sequence’’ of transformations describes a restricted type 
of reaction chain, was suggested by the parallel structure of se- 
quences of modules or homology groups with their associated 
homomorphisms, and suggests the extension of the exactness no- 
tion to more general, abstract sets and their mappings in ordinary 
set theory. Similarly, the molecular set notion of ‘‘equivalence 
class’’ describes the structure of an effective or reactive multi- 
plet, was suggested by the parallel algebraic concept, and sug- 
gests the extension of the equivalence class notion into ordinary 
set theory. In both examples the extensions could be in terms of 
more general sets than those which parallel the finite sets in 
molecular set theory. 

In summary then, molecular set theory leads to 1) a well defined, 
highly specific language for representing the molecular associ- 
ations involved in chemical reaction mechanisms, 2) a mathemati- 
cal rationale for analyzing and predicting the finer aspects of such 
intermolecular associations, and 3) a new variety of mathematical 
problems in the domain of ordinary set theory. 


The author acknowledges his thanks to Dr. Rashevsky whose 
pertinent comments inspired the expansion of the discussion into 


its present form. This work was supported by the Howard Hughes 
Medical Institute. 


Bartholomay, A. F. 1957. ‘*A Stochastic Approach to Chemical Reaction 
Kinetics.’? §.D. Thesis, School of Public Health, Harvard University. 
1958. ‘*Stochastic Models for Chemical Reactions: I. Theory of 


ie bi ge Reaction Process.’’ Bull. Math. Biophysics, 20, 
175-190. 


CHEMICAL REACTION MECHANISMS 307 


1959. ‘‘Stochastic Models for Chemical Reactions: Il. The Uni- 
molecular Rate Constant.’’ Bull. Math. Biophysics, 21, 363-373. 


Brown, H. C., Fletcher, R. S. 1949. ‘*Chemical Effects of Steric Strains. 
I. The Effect of Structure Upon the Hydrolysis of Tertiary Aliphatic 
Chlorides.’’ J. Am. Chem. Soc. 71, 1845-1854. 

Cartan, H., Eilenberg, S. 1956. Homological Algebra. Princeton, N. J.: 
Princeton Univ. Press. 

Frost, A. A., Pearson, R. G. 19538. Kinetics and Mechanism. New York: 
John Wiley and Sons, Inc. 

Hinshelwood, C. N. 1927. ‘*Theory of Unimolecular Reactions’’ Proc. 
Roy. Soc. A 113, 230-2338. 

Hurewicz, W. 1941. ‘‘On Duality Theorems’’ (Abst.). Bull. Amer. Math. 
Soc. 47, 562:. 

Kelley, J. L., Pitcher, E. 1947. ‘‘Homology Theory.’’? Ann. Mathematics 
48, 682-709. 

Lindemann, F. A. 1922. ‘‘Discussion on the Radiation Theory of Chemi- 
cal Action.’’ Trans. Faraday Soc. 17, 598-599. 

Rashevsky, N. 1954. ‘*Topology and Life: In Search of General Mathe- 
matical Principles in Biology and Sociology.’’ Bull. Math. Biophys. 
16, 317-348. 

1955 a. ‘*Some Theorems in Topology and a Possible Biological 
Implication.’’ /bid., 17, 111-126. 

___—s« 1955 b. ‘*Some Remarks on Topological Biology.’’ /bid., 17, 
207-218. 

__ 1955 c. ‘Life, Information Theory, and Topology.’’ /bid., 17, 
229-235. 

_______—-1956. **The Geometrization of Biology.’’ /bid., 18, 31-56. 

_____—- 1957. **A Note on the Geometrization of Biology.’’ /bid., 19, 
201-4. 

1958. ‘‘A Contribution to the Search of General Mathematical 
Principles in Biology.’’ /bid., 20, 71-93. 

______-1959. ‘*A Set Theoretical Approach to Biology.’’ /bid., 21, 101- 
106. 

Rosen, R. 195828. ‘‘The Representation of Biological Systems from the 
Standpoint of the Theory of Categories.”’ Bull. Math. Biophysics, 20, 
317-341. 

1958 b. ‘*A Relational Theory of Biological Systems.’’ /bid., 20, 
245-260. 

1959. ‘‘A Relational Theory of Biological Systems.’’ Ibid., 21, 
109-128. 

Schroedinger, E. 1945. What is Life? The Physical Aspects of The 
Living Cell. New York: The Macmillan Co. 

Wacker, W. E. C. and Vallee, B. L. 1959. ‘*‘Nucleic Acids and Metals 
1.’ J. Biological Chemistry, 234, 3257-3262. 


RECEIVED 9-3-59 


s~ 
e a * x” 
. Bi ma 
7 ox - 25; 
= ~ 
a SAG 


7 > 9 rapa i 
G -epi reget. - a == 


BULLETIN OF 
MATHEMATICAL BIOPHYSICS 
VOLUME 22, 1960 


SOME MATHEMATICAL ASPECTS OF CHEMOTHERAPY-—II: 
THE DISTRIBUTION OF A DRUG IN THE BODY 


JOHN A. JACQUEZ, RICHARD BELLMAN AND ROBERT KALABA 
DIVISION OF EXPERIMENTAL CHEMOTHERAPY, SLOAN-KETTERING 
INSTITUTE FOR CANCER RESEARCH AND SLOAN-KETTERING 
DIVISION, CORNELL UNIVERSITY MEDICAL COLLEGE, 

NEW YORK CITY; AND THE RAND CORPORATION, 

SANTA MONICA, CALIFORNIA 


In a previous paper (Bellman, Jacquez, and Kalaba, Bull. Math. Bio- 
physics, 22: 181-198, 1960) a model of the processes occurring in the ex- 
change of a drug between capillary plasma, extracellular space and intra- 
cellular space was developed. This included the possibility of a reac- 
tion between the drug and a component of the intracellular space. The 
equations developed thus describe the events within a capillary bed. In 
the present paper, a simplified model of the body is set up. Each organ 
is treated as a single capillary bed and is linked to other organs via the 
circulation, in the parallel and/or series arrangements found in the body. 
Mixing in the circulation is included at the simplest possible level. The 
concentration of drug entering any one capillary bed is determined by 
the concentrations leaving all other capillary beds, the time lags, and 
mixing involved in the circulation. The equations describing these proc- 
esses in conjunction with the equations of the processes occurring within 
each capillary bed lead to a large set of differential-difference equations. 


Introduction. The mathematical treatment of a realistic model of 
the. distribution of a drug in the body and of the effect of this dis- 
tribution on the drug action is a significant problem of current in- 
terest in physiology, pharmacology and the theory of chemotherapy. 
An early attempt to describe the distribution of nitrogen was that 
of Von Schrotter (1906), while Teorell (1937) gave the first exten- 
sive mathematical treatment of drug distribution which took into 
account diffusion between tissues and blood, elimination by the 
kidneys, and detoxification. However, the role of the circulation 
of the blood was virtually ignored by treating the total supply of 
blood as one uniform pool. Smith and Morales (1944) and Morales 
and Smith (1945, 1948), on the other hand, have treated the problem 


309 


310 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


of the distribution of an inert gas and have pointed out the impor- 
tance of the arrangement of the tissues with respect to the cir- 
culation. 

In a previous paper (Bellman, Jacquez, and Kalaba; 1960), we 
set up a model of the distribution of a compound within the plasma 
flowing through a capillary bed, and within the extracellular space 
and intracellular space. Included in this model is the possibility 
of the reaction of the drug with a component of the intracellular 
space. 

In the present paper, we wish to extend the model so as to in- 
clude the effects of the major tissues of the body and of the circu- 
lation. Our aim is to show the relationship of this general problem 
to feedback problems, input-output analysis, and transportation 
problems which arise in economic control processes. 

Preliminary Discussion. The model we employ of the distribu- 
tion of a compound between the plasma flowing through a capillary 
bed, the extracellular space, and the intracellular space, leads by way 
of standard arguments to equations (1) to (4), (Bellman, Jacquez 
and Kalaba; 1959). It is assumed that the drug D reacts with a 
compound £ to form ED in the intracellular space. 


a[*e4) 

9 U; + U 

R ae Clu; — a) = A hese | 

P dt (4; u) C28 2 vl; (1) 


dv U,+ U 
Rk. at 7 Bee | 9 — -k;A; : | -— kA;v, (2) 


dw 
dz 


The various terms in equations (1) to (4) are defined below. 


u;(¢) = concentration of drug in the plasma entering the (5) 
capillary bed. 


u(t) = concentration of drug in the plasma leaving the (6) 
capillary bed. 


v(d) = spatially uniform concentration of drug at time ¢ (7) 
in the extracellular space. Probably this should 


DISTRIBUTION OF DRUG IN BODY 311 


be viewed as a spatial average of the drug con- 
centration in the extracellular space. 


w(t) = uniform intracellular concentration of drug. An _ (8) 
assumption of immediate mixing has been made. 


2(¢) = concentration of compound ED in the intracellular (9) 


space. 

FE, =concentration of substance EF at ¢ = 0. (10) 
C = volume rate of plasma flow in the capillary bed. (11) 

FR, = plasma volume in the capillary bed. (12) 

FR, = volume of extracellular space. (13) 
FR, = volume of intracellular space. (14) 


A, = area of contact for diffusion between extracellular (15) 
space andthe plasma in the capillary bed. 


A, = area of contact between extracellular and intra- (16) 
cellular space. 


k, = permeability constant for diffusion between plasma (17) 
and extracellular space. 


k, = permeability constant for duffusion between extra- (18) 
cellular and intracellular space. 


K =transport constant. We assume that besides dif- (19) 
fusion there may be an active transport concen- 
trating the compound intracellularly and that this 
term is given by KA,v. 


k,,k, = the rate constants for the reaction (20) 


ky 
Foie ff ED). 

k2 
It is pertinent at this point to consider the size of the capillary 
bed to which these equations apply. The equations were derived 
from a macroscopic viewpoint and apply, therefore, to average con- 
centrations; they do not describe the details of diffusion around 
one capillary. On the other hand, the blood flow to large regions 
or organs is not uniform through all portions. Large regions should 
be viewed as a group of capillary beds (to which the equations are 
applicable) which are primarily in a parallel arrangement with a 


312 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


distribution of blood flows, or transit times. For the time being we 
eschew this statistical approach to concentrate on the large scale 
features of blood flow through the major organs of the body. We 
are assuming that an organ can be viewed as one large capillary 
bed with one uniform mean blood flow. One further reservation 
concerning the applicability of equations (1)-(4) should be made. 
In deriving equations (1)-(4), an assumption of immediate mixing 
in the extracellular space was made. The relatively uniform dis- 
tribution of capillaries with respect to position and direction of 
flow tends to smooth out concentration gradients arising from dif- 
fusion between capillaries and the extracellular fluid and makes 
the preceding a fair approximation for most tissues. 

This may not be true for the central nervous system. The latter 
has a large extracellular volume which has a relatively restricted 
area of contact with capillaries compared to other regions. How- 
ever, in lieu of detailed knowledge as to proper assumptions to 
make, we use the foregoing as an approximation. 

For the present purposes, therefore, we consider an organ as a 
box with a flow of plasma containing the compound at a concentra- 
tion u, entering it, and leaving it at a concentration u. 


FIGURE 1. Input-output model of an organ. 


Equations (1)-(4) then describe processes going on inside the 
box. The linkage between various organs will occur through the 
boundary conditions imposed upon the u; by the mixing and re- 
circulation of the blpod. These conditions will be dependent on 
the arrangement of the various organs in relation to each other and 
to the circulation (parallel or series arrangements), and on the 
nature of the circulatory system. 

Anatomical and Physiological Considerations. The majority of 
the organs of the body are in parallel arrangement with respect to 
blood supply. However, the blood supply to the gastrointestinal 
tract, spleen and liver has a special arrangement which requires 
further discussion, as does the model for the kidney and the over- 
all circulation. 

A. Gastrointestinal Tract, Spleen, Liver. The circulation to 
these organs forms a parallel-series arrangement called the portal 


DISTRIBUTION OF DRUG IN BODY 313 


system. As shown in Figure 2, the arterial supply comes from three 
major sources. The upper one, the coeliac artery, gives rise to 
the hepatic artery, and the arterial supply to the spleen, stomach, 
pancreas and upper duodenum; the capillary beds of these last four 
have been lumped in the model. The superior mesenteric artery 
supplies the major part of the small intestine while the inferior 
mesenteric artery supplies the major part of the large intestine. 
The venous drainage from these three areas combines to form the 
portal vein which enters the liver where the portal venous blood is 
mixed with the arterial blood in the hepatic artery at the point 
where it enters the liver sinusoids (a modified capillary bed). 
Thus the liver is perfused by a mixture of arterial blood and of 
venous blood which has already passed through capillary beds in 
the spleen and gastrointestinal tract. This parallel-series ar- 
rangement is itself in parallel with other organs. 


—_— COELIAC 
ARTERY 


= SMALL SUPERIOR 
: MESENTERIC 
LARGE INFERIOR 
—_—_ 
INTESTINE MESENTERIC 


FIGURE 2. The circulation to the gastrointestinal tract, spleen and 
liver. 


B. The Kidneys. The first step in the formation of urine in the 
kidney is the formation of an ultrafiltrate of plasma, the glomerular 
filtrate. Because of their ability to concentrate materials in or re- 
absorb from the glomerular filtrate, the kidneys represent a special 
problem. The mechanisms of kidney function are still not clear 
enough to allow us to make a detailed model. However, we can in- 
corporate some of the major phenomenological features of renal 


314 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


function in a model which assumes relatively constant blood flows 
so as to make the glomerular filtration rate constant. The model 
adopted can then be the same as that for other organs with the ad- 
dition of a sink for loss of material (urinary excretion) and the 
major features of urinary excretion can be incorporated in the loss 
function describing the action of the sink. The model is shown in 
Figure 3. 


41 sin 


FIGURE 3. The model for the kidney. 


The proper loss function to use will depend on whether the drug 
is reabsorbed, actively secreted into the urine or neither. For a 
compound which is neither reabsorbed nor secreted, the rate of 
loss of drug will be given by equation (21). 


dD 


Tn =-Aw;. (21) 


For a compound like glucose which is reabsorbed and has a thresh- 
old, the rate of loss will be given by a function of the type shown 
graphically in Figure 4. This may be approximated by equation 


(22) 
dD 0, u; < u*. 
ie (22) 


-A(u,; - ut), u; 2 ur. 


For a drug that is actively secreted, the rate of loss of drug by 


urinary excretion will be given by equation (23) which describes 
the curve in Figure 5. 


aD ‘ 
mee ey HALO, (23) 


This may be approximated by equation (24). 


DISTRIBUTION OF DRUG IN BODY 315 


uy 


FIGURE 4. Excretion rate for a compound which is reabsorbed from 
the glomerular filtrate and has a threshold. 


7 (24) 
—\ ,u¥ -A,(u; — ux), u;, 2 UF, A, > Ad. 


Equation (1) for the kidney must be changed by the addition of 
equation (22), (23), or (24). 

Provided there is no diuresis and we can assume that the body 
is in water balance, we can neglect the change in volume of blood 
passing through the kidney due to excretion of urine. 


FIGURE 5. Excretion rate for a compound which is actively secreted. 


316 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


C. The Circulation. The major problem in the linkage of these 
systems is the nature of the blood flow and mixing in the large 
vessels and mixing in the heart. 

1. Flow and Mixing in the Large Vessels. Flow in the large 
vessels is probably laminar with local areas of turbulence at some 
branching points and at the root of the aorta. To try to incorporate 
the equations of flow of a fluid such as blood in this model would 
needlessly complicate the picture since we need primarily a de- 
scription of the mixing effects in the circulation. We assume 
simple volume (displacement) flow and further that there is com- 
plete mixing perpendicular to the direction of flow. Any mixing in 
the direction of flow would tend to smooth out concentration gradi- 
ents along a vessel: such an effect can be partly incorporated in 
the parameters describing the mixing action of the heart. 

2. The Action of the Heart. The right and left sides of the heart 
each act like a pump with a residual volume Vp and an ejection volume 
V,- If C is the minute output, R the heart rate, the period is 
tT =1/RF and V,;R=C. In each cycle there is an ejection phase 
(systole) and a filling phase (diastole). With each systole the 
volume V; is pumped into the aorta, which acts like an elastic 
reservoir to keep blood flowing into its branches during diastole. 
Consider now a simple case in which we assume volume displace- 
ment flow with an elastic reservoir effect to maintain flow through- 
out a cycle of heart action. Let y, be the concentration of a com- 
pound in the residual blood in the pump and let its concentration 
be zero in the blood entering the heart. Then the concentration 


in the first injection volume will be y, ( ) in the second 
E> R 

Ve ° 

Yo {> ] » and in general the concentration in the mth ejec- 

tion volume will be given by equation (25): 


V n 
Yn =Y¥o (aa , for (n-—1)T St< ant. (25) 


E+ Vp 


This description yields the graph shown in Fig. 6. In reality, 
mixing in the aorta smooths out these discontinuities so it is a 
fair assumption that y(t) will be given by equation (26) for all ¢: 


V t/T V (C/Vp)t 
oon tiny at) 


E+Vp Vi + Vp 


DISTRIBUTION OF DRUG IN BODY 317 


¥(t) 


FIGURE 6. Graph of y(¢). 


This may also be written as in equation (27): 


y (2) = yo exp. 1-7. In | 


E 


Your V 
E “] th m rye (27) 


E 


where V* = V,/In ( + 3} 
Ve 


This, however, is the equation of a mixing pot of volume V* with 
entering and leaving flow rates C, and with instantaneous mixing 
in the pot. Therefore, in the model we may represent the right and 
left sides of the heart by mixing pots of volume V*. 

The Complete Model. We may now proceed to construct the com- 
plete model, a schematic diagram of which is given in Fig. 7. The 
pulmonary circuit from the right side of the heart to the left is on 
top and the systemic circuit is below the heart. We follow the flow 
of blood starting from the points where it leaves the systemic 
capillary beds. The concentrations leaving the systemic capillary 
beds will be u,(¢) for the liver, u(t) for the kidney, and w,(¢) to 
u,(¢) for the n — 2 other systemic regions which are not special- 
ized, as are those for the liver and kidney. The quantities Tp, 
represent the mean times for flow from the exit of capillary bed j to 
the right side of the heart. Then the total volume flow rate into 
the right heart is given by the following expression, 


ae eee (28) 


jet 
while the concentration of material entering the right side of the 
heart will be up(t), as given by 


=C 


j 


Up(t) = 


318 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


VOL. TIME LAG 
EFFLUENT FLOW TO 
CONCENTR. RATE RT HEART 


TIME LAG 
ENTERING FLOW FROM LEFT 
CONCENTR. RATE HEART 


Uil,) cy TL1,1 
Uil2 cl2 Wiz 


uy 1 Tri 


Uns 3 Wi 


Yi1,4 O14 TL 4 


U2 C2 Tre 


FIGURE 7. A model of the body. 


Since the right heart is representable as a mixing pot of volume V*, 
the concentration leaving the right heart up,(¢) is given by the 
following relation: 


~yat aye? or Jo age se ee 

Upp (t) = Upy (0) e + a8 ( Up(z) € z (30) 

Then if Tp, is the time lag for flow from the right heart to the 

capillary bed of the lungs, the concentration entering the lung 

capillary bed will be up,(t- Tp,). Let u,,(t) be the concentra- 

tion leaving the capillary bed of the lungs. If Ty, is the time lag 

to the left heart, the concentration entering the left heart will be 

upr(¢-T,,). Again, since the left heart is a mixing pot of volume 

V*, the concentration leaving the left heart, u,(t) will be given by 
the formula (81). 


-yEr C ~ iE t sS* 
up(%) = uz (O)e Wists | Urr(@-Tyy)e” de (81) 


DISTRIBUTION OF DRUG IN BODY 319 


For the arterial side of the systemic circuit, for 7 22, the concen- 
tration entering the capillary beds will be 


u;(t) = up(t¢-T,,), 722. 
The circulation to the gastrointestinal tract, spleen and liver re- 
quires a more detailed description. The concentration in the he- 


patic artery at the point where it enters the liver sinusoids is 
is given by 


Ui11(9 = up (-Tr 44) 

with time lag tT, , , from the left side of the heart. Let w;, ., 
U;;,3 and u;, , be the concentrations entering the capillary beds of 
a) stomach, spleen and pancreas; b) the small intestine, and 
c) the large intestine, respectively. Let uv, ., wu. , and u, , be 
the concentrations in the blood leaving these capillary beds and 
let T,,, T,3, and T,, be the time lags from these capillary beds 
through the portal vein to the liver sinusoids. We assume that on 
entering the liver sinusoids the portal blood will be mixed com- 
pletely with that entering from the hepatic artery. Then the con- 
centration in the blood entering the liver sinusoids will be w;,(2), 
where 


4 
C yup(t-T14,3) + De C y(t — Thx) 


u; ,(t) = (32) 


The volume of the blood in the various parts of the circulation 
links the flow rates and time lags, giving equations of constraint 
on the system. Thus the volume in the systemic veins, i.e., from 
the exit from the systemic capillary beds to the right side of the 
heart, is 


Vey =), Ci Try (33) 
j= 


and the volume from the right heart to the pulmonary capillary bed 
is 


Ver =CTrr- (34) 


320 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


The volume from the exit from the pulmonary capillaries to the left 
heart is 


Vine = Oe pe (35) 


The volume in the systemic arteries is 


4 
Vsa= >. Cu Trret ), C; Trp (36) 


While the volume in the portal venous drainage is 
4 
v= ss Crk Tok (37) 
k =2 


4 
We also have the relation C, = De C ,, for the flows into the liver. 
km 


Initial Conditions. The initial conditions will be determined by 
the mode of administration of the drug. Let us discuss the various 
possibilities. 

A. Injection into a Blood Vessel. Intravenous and intra-arterial 
administration may be considered together. An injection over a 
small period of time of a compound into a vein or artery can be 
treated as a rectangular concentration wave moving down the ves- 
sel. If the volume occupied by this injection is much less than 
the ejection volume of the heart, an intravenous injection could be 
treated as an instantaneous injection of the compound into the 
right heart. A constant intravenous injection rate is often used. 
Provided that the volume injected is small compared to the total 
blood volume and that the body is in water balance, this case may 
be treated like that of a source in the vessel which adds d/C, to 
the concentration of compound in the vessel. Here dis the rate of 
drug administration and C ; 1S the volume flow rate in the vessel. 

B. Intramuscular and Subcutaneous Injection. The result of the 
deposition of an amount of drug intramuscularly or subcutaneously 
is to place a source in the extracellular space of these tissues. 
This case may be treated by adding the equation of the source to 
equation (2) for the tissue involved. If we call the amount of drug 
in this source D, the rate at which drug will be delivered to the 
extracellular space of the tissue involved will be given by equa- 


DISTRIBUTION OF DRUG IN BODY 321 
tion (38): 


aD 
Be a bt), for D > 0. (38) 
This term should be added to equation (2) for the tissue involved 
and equation (38) should be added to the system of equations 
(1)-(4). 

C. Oral Administration. If there is no active absorption mecha- 
nism, a drug administered orally probably can be treated as a 
source in the extracellular space of the small intestine with an 
equation similar to equation (38). If the drug enters an active ab- 
sorption mechanism in the intestine, the problem is more difficult. 
Since this is still a poorly understood mechanism, it perhaps can 
be handled like the kidney: treat the injected drug as a source in 
the extracellular space of the intestine with an absorption equa- 
tion that describes the major experimental features of absorption 
of the particular drug involved. 

Discussion. Once we have formulated the problem of tracing 
the path of a drug through the body, and determining its concen- 
tration as a function of time in any particular location, we observe 
the analogy between this problem and many of the problems arising 
in economic and industrial life. Many of the vital scheduling, 
transportation and production processes possess the same abstract 
format. All of these processes can be considered to be input- 
output processes. 

Similarly, once we examine the question of determining the ap- 
propriate drug to use, its rate of injection and its total dosage, in 
terms of the foregoing model, we see that we face a problem which 
is more difficult than, but completely analogous to, the by now 
classical feedback control problem of engineering control theory, 
and the corresponding control process of mathematical economics. 

The advantage of recognizing these analogies lies in the possi- 
bility of borrowing various computational and theoretical tech- 
niques that have proved of value in these fields which have been 
exposed to mathematical study for a longer time and in a more in- 
tensive fashion. 

Large Scale Mathematical Models and Computers. It is impor- 
tant to emphasize the fact that mathematical models of the fore- 
going type are not designed solely for analytic purposes. By this 
we mean that it is not to be expected that realistic models of any 


322 J. A. JACQUEZ, R. BELLMAN AND R. KALABA 


complexity can be treated purely by hand techniques involving ex- 
act solutions in terms of known functions. In formulating these 
models we are thinking in terms of the computational solution by 
means of modern high speed computers, either digital or analogue 
devices, depending upon the types of equations we encounter. 

Once we have clearly stated this objective, we emancipate our- 
selves from all sorts of bonds such as linearity, stationarity, and 
so on, which effectively prevent us from coming to grips with the 
actual processes. 

It is also important that we recognize the fact that modern ma- 
chines can be used not only to resolve explicit analytic equations 
describing various systems, but also to enable us to utilize simu- 
lation techniques to study the operation of still more complex 
systems. 

We feel that the advent of the modern computing device will per- 
mit us to approach for the first time all types of biological prob- 
lems which, because of their complexity and sheer magnitude, have 
previously been only sadly surveyed from a distance. 


LITERATURE 


Bellman, R.; J. A. Jacquez, and R. Kalaba. 1960. ‘*Some Mathematical 
Aspects of Chemotherapy—I: One Organ Models.’? Bull. Math. Bio- 
physics, 22, 181-198. 

‘Morales, M. F., and R. E. Smith. 1945. ‘*A Note on the Physiological 
Arrangement of Tissues.’? Bull. Math. Biophysics, 7, 47-51. 

Morales, M. F., and R. E. Smith. 1948. ‘On the Theory of Blood Tissue 
Exchange of Inert Gases, VI.’? Bull. Math. Biophysics, 10, 191-200. 
Smith, R. E., and M. F. Morales. 1944. ‘*On the Theory of Blood Tissue 

Exchanges, I.’? Bull. Math. Biophysics, 6, 125-131. 

Teorell, T. 1937. ‘*Kinetics of Distribution of Substance Administered 
to the Body.’® Arch. Intern de Pharmacodyn. et de. Therap., 57, 
205-240. 

Von Schrotter, H. 1906. ‘*Der Sauerstoff in der Prophylaxe und Therapie 


der Luftdruckerkrankungen.’? Handbuch der Sauerstofftherapie, M. 
Michaelis, Berlin. 


RECEIVED 7-23-59 


